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]. Basis of the report 



1 . With regard to the elements of the international application: * 
I X I the international application as originally filed 
the description: 



pages 
pages 
pages 



1-29 



NONE 



_ , as originally filed 
filed with the demand 



NONE 



, filed with the letter of 



the claims: 

pages 

pages 

pages 

pages 



30-37 



NONE 



NONE 



, as originally filed 

as amended (together with any statement) under Article 19 

, filed with the demand 



NONE 



, filed with the letter of 



the drawings: 

pages 

pages 

pages 



1-8 



NONE 



, as originally filed 

, filed with the demand 



NONE 



I x| the sequence listing part of the description: 
pages NONE 



, filed with the letter of . 



pages 
pages 



NONE 



, as originally filed 

, filed with the demand 



NONE 



, filed with the letter of , 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international apphcation was filed, unless otherwise indicated under this item. 

Tliese elements were available or furnished to this Authority in the following language which is: 

I I the language of a translation furnished for the purposes of international search (under Rule 23.1(b)). 
I I the language of publication of the international application (under Rule 48.3(b)). 

[ I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 and/ 
or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international 
preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in printed form. 

I [ filed together with the international application in computer readable form. 

I I furnished subsequently to this Authority in written form. 

I I furnished subsequently to this Authority in computer readable form. 

□ The statement that the subseauently furnished yvritten sequence listing does not go beyond the disclosure in the 
intemational application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the writen sequence listing has 
been furnished. 

4 I x[ The amendments have resulted in the cancellation of 

0 the description, pages NONE 

the claims, Nos. NONE 

\~x\ the drawings, sheets/fig NONE 



5 . I I This report has been drawn as if (some of) the amendments had not been made, since they have been considered to go 

beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)).* ♦ 
* Replacement sheets which have been Jumished to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed" and are not annexed to this report since they do not contain amendments (Rules 70.16 
and 70.17). 

**Anv replacement sheet coruaining such amendments must be referred to under item 1 and annexed to this report. 
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III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 



1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non obvious), or to be 
industrially applicable have not been and will not be examined in respect of; 

[ I the entire international application. 

fx] claims Nos. 5-6, 17-18, 24-43 



□ 



because: 

the said international application, or the said claim Nos. _ relate to the following subject matter which 
does not require international preliminary examination (specify). 



I I the description, claims or drawings (indicate particular elements below) or said claims Nos. _ are so 
unclear that no meaningful opinion could be formed (specify). 



I I the claims, or said claims Nos. _ are so inadequately supported by the description that no meaningful 
opinion could be formed. 

no international search report has been established for said claims Nos. 5-6. 17-18, 24-43 . 



2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide and/or amino acid 
sequence listing to comply with the standard provided for in Annex C of the Administrative Instructions: 

I I the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



1 . slalement 

Novelty (N) 



Claims 1-4. 7-16. 19-23 



Claims NONE 



YES 
NO 



Inventive Step (IS) 



Claims NONE 



Claims 1-4, 7-16. 19-23 



YES 
NO 



Industrial Applicability (lA) 



Claims 1-4, 7-16, 19-23 



Claims NONE 



YES 
NO 



2. citations and explanations (Rule 70.7) 

Claims 1-4, 7-16, 19-23 lack an inventive step under PCT Article 33(3) as being obvioias over either Bimbaum (Journal of 
Virology, April 1994, Vol 68, No. 4, pages 2521-2528) or Clem et al (Science, vol. 254, November 1991, pages 13881390)in 
view of Zolotukin et al (US Patent 5,874,304). 

Bimbaum et al teach an apoptosis -inhibiting gene from s nuclear polyhedrolsis virus encoding a polypeptide with Cys/His 
sequence motifs. The reference teaches that two different baculovirus genes are known to be able to block apoptosis triggered 
upon infection of Spodoptera cells with p35 mutants of the insect baculovirus. the reference teaches that using a genetic 
complementation assay to identify additional genes which inhibit apoptosis during infection with p35 mutant, Bimbaum et al have 
isolated a gene OpNPV that was able to functionally substitute for AcMNPV p35. The reference teaches a genetic 
complementation assay to identify apoptosis blockin genes in which SF-21 cells were cotransfected with vAcAnh and the test 
baculovirus DNA and transfected cells monitored. 

Clem et al teach prevention of Apoptosis by a Baculovirus Gene during Infection of Insect cells. Clam et al have identif ied 
annihilator (vAcAnh) viral mutant was identified from expression vectors. The mutant caused premature death of Spodoptera 
fmgiperda cells. 

Either Bimbaum et al or Clem et al have taught the method of identification of the gene or the polypeptide involved 
in apoptosis using the a reporter gene expression. However. Zolotukhin et al teach humanized green fluorescent protein genes 
and methods of use of the protein in several assays. The reference teaches that the expression vectors may comprise a multiple 
cloning sites that is operatively positioned downstream from gfp gene sequence, and these vectors are useful in addition to the 
uses in creating C terminal fusion proteins by cloning a second protein encoding DNA segment into the multiple cloning site 
so that it is contiguous and in frame with the gfp sequence. The reference teaches that the recombinant host cells will express 
GFP to produce encoded GFP protein, preferably in amount sufficient to allow GFP detection by its fluorescence. The reference 
teaches that the expression vectors comprising a GFP (Continued on Supplemental Sheet.) 
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CLASSIFICATION: 

The International Patent Classification (IPC) and/or the National classification are as listed below: 
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V. 2. REASONED STATEMENTS - CITATIONS AND EXPLANATIONS (Continued): 

gene operatively linked to a selected gene, and the fusion protein being produced in amount sufficient to allow cell detection 
by detecting the green fluorescence of GFP. Thus, it would have been obvious to use the GFP protein as a reporter gene in the 
method of identifying the gene involved in the apoptosis. 



Claims 1-4, 7-16, 19-23 meet the criteria set out in PCT Article 33(2) and (4), because the prior art does not teach the method 
of identifying the a polypeptide which increases or decreases gene expression from a promoter using a GFP reporter gene 
operably linked to said promoter. 



NEW CITATIONS 



NONE 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This ineniaiional report has na been established in respea of certain claims under Article 17(2)(a) for the following reasons: 
1. I I Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

— because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. I I Claims Nos.: 

because they are dependent claims and are na drafted in accordance with the second and third sentences of Rule 6.4<a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found mtiltiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 - [ I As all required additional search fees were timely paid by the applicant, this iiuemational search report covers all searchable 



claims 



2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3 . I I As only some of the required additional search fees were timely paid by the applicant, this iiuemational search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. [ x| No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-4. 7-16. 19-23 



Remark on Protest | | The additional search fees were accompanied by the appiicaiu's protest. 

I I No protest accompanied the payment of additional search fees. 
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Expression Cloning of Oncogenes by Retroviral Transfer 

of cDNA Libraries 

IAN WHITBHEAD, HEATHER f ^ 

Depaimm of Ma^^ Uniymity of British Columbia, and Tory Fax Lahonitory, 

Bnash Columbm Cam^Agemy^^Vahcquver, British Cobwtbia, Canada V5Z 4E6 - 

/Received 23 JuM 1994/Rcturocd for modificatfo^ 

A cDNA UbiBiy toansfcr ^ystem based on retrxMral vectors has been developed for expression doniiui in 

tninsferSLe;;^;^^^^ 

lOO-foW higher than those acUeved by transfection and theiefore enables the transfer and functional scmXe 
^ ^^"^ ^ tattW application of retmvlral transfer of cDNA libraries, wSe^J^SSf 
cDNAs which hidoce oncogenic transformation of IWI 3T3 fibroblasts, as measured bv l^lj«^*or* 
"SS^Z^i^^^r"^^ transfbnnhig cDlS?25 ^^^lat^ ?tS 2 ^55 

^^^^J^^^ of these cDNAswere derived from kno^ oncogenes (n?AAfct amt^fS 
S^.nTfl'^^SI^:!?.^ ^ ^ cioned prevloos^y bun«^ SSe^^^^SE 

SnSrfirSJii^^^^^^ "^^^^ Pbosphollpase C^, and SpM pmtease hihlbtor 

^SiJS^tLI^^^^ "2^**^ orientation and therefore is likely to act as an faihlbltor of 

an endogenous transformatfon suppressor Seven novel cDNAs with tnmsfbrmhig activities. tochidlMdM^ for 

" > .^^t Retroviral transfer of libraries should be^mwally usd^ for donlMdSNA^^SiK 

confer selectable phenotypes on maiv different types of mamnu^^ J u»«nu lor aonmg cuw as wlilch 



Cellular oncogenes or proto-oncogenes can be cloned by 
selecting for their ability to confer the phenolype of deregu- 
lated growth on cells in which they are expressed. This was first 
done by transfecting fragmented genomic DNA into cell lines 
such as NIH 3T3 fibroblasts which are susceptible to single-hit 
ncogenic transformation (13, 15. 31, 36, 40, 46). Subsequently, 
stable transfection of these cells with cDNA h-braries in plas- 
mid or phage expression vectors has been used as an alterna- 
tive approach, to avoid the severe difficulties which have been 
encountered in recovering and analyzing oncogenes present in 
transfartcd genomic DNA (8, 9, 32-34). Despite the theoret- 
ical advantages of woridng with transfected cDNA e:q)r€ssion 
hl)raries, only a small fraction of the many currently known 
oncogenes have been cloned in this way. The most significant 
limitation in the use of cDNA horary transfer for cloning 
oncogenes has been the low efficiencies of cDNA transfer and 
expression which can be achieved by stable transfection meth- 
ods. It is difficult to generate more than a few tens of thousands 
of transfectants in which cDNA clones are being ejqiressed at 
adequate levels, but a comprehensive screening of a mamma- 
lian cDNA library demands the transfer and expression of 
several million clones. Therefore, the small number of onco- 
genes which have been cloned so far from transferred hbraries 
is unlikely to be due to exhaustion of the pool of cDNAs with 
oncogenic potential but rather is a result of a failure to trans- 
fer, and thus detect, most such cDNAs. 

In contrast to deh*berately constructed cDNA libraries, nat- 
ural populations of retroviruses have served as a prolific source 
of oncogenes, through their ability to incorporate and muta- 
tionally activate cDNA copies of host cell mRNAs and then to 
transfer them to recipient cells by infection (5). Artificial ret- 
roviral vectors with equivalent abilities to transfer and express 



• Corresponding author. Mailing address: Teny Fox Laboratory BC 
Cancer Agency. 601 W. 10th Ave., Vancouver, BQ Canada V5Z 4E6 
Phone: (604) 877^0. Fax: (604) 877^12. Electronic mail address* 
robeTt@tenyfox.ubc.ca. 



cDNAs in manmialian cells (35) could be ideally suited to 
SCTcening hT)raries for cDNAs conferring selectable pheno- 
types such as deregulated ceU growth. Previous reports have 
mdicated the potential for using retroviruses.to clone cDNAs 
conferring selectable phenotypes, as demonstrated by the iso- 
lation of cDNAs encoding thymidine kinase (TK) via rescue of 
TK* fibroblasts (39) and cDNAs encoding interleukin-3 and 
^OTulocyte-macrophagc colony-stimulating factor via conver- 
sion of hemopoietic cell lines to pytokine-independent growth 
(42). This report describes the development of an efficient 
system for cDNA horary transfer and expression by retroviral 
vectors and its application to the cloning of a large number of 
cDNAs which induce oncogenic transformation of murine fi- 
broblasts. 



MATERIALS AND METHODS 

pCTV n^on. pCTVl was constructed from the followtng elements: (0 an 
iJ'i^r' COTtoining the U3 regkm and part of the R rcgiJm of 

1^^%???^ J^^i^?^ (MMLVX e«ised from the JZen Xviral 
^i^k^ir^^^^^ fragnwnt extending from the R region to base 1035 
I!!, ^^l^ gtnomic sequence (GenBank accession number J02255), derived 
by PCR from the pBabeHygro retroviral vector (38); (iii) an JOtol-Clal fragment 

expression vector (24); (iv) a Oal-BamHl fragment from 
a wbctonc containing the myeloproUferative sarcoma vinis (MPSVVMMLV 
hybrid 3' long tenninal repeat (LTR) from JZcn akmg with UO bp of unchar- 
actenzcd genonuc DMA sequence downstream of the LTR; and (v) a Brfll^I 
fragment from pAXU4, containing the p-globin and sunian virus 40 (Svio) 
po^nytetion sites, the S V40 origin of replication, and the origin of repUcation 
irom pUCS. 

pCTV3 was constructed by inserting a ClaMccl fragment containing a bymv 
mydn^ resistance gene into the Oil site of pCTVl. This gene comists a 
synthetic polyomavinis enhancer and a herpes simplex vinis TK promoter (49) 
attadKd to a hygromydn pbospbofaydrolase gene (3) with a modified transUtion 
uutiatton sequence. 

pCTVlB and PCIV3B were derived by mserthig the 380*p BstYl fragment 

™^ "^^^ P^M8 (44) between the M^Tof 

p<- 1 VI or pCTV3. 

Pi'^T^!^ *^ ^™ ^ rspi jcemeni of the supF gene with the 

Plj)romoter of pBR322 and the neomycin phosphotransferase coding region of 
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Polyoma 




L panidiaium ijolhiocyjMte ^>'»^fl^^ uie poljr(A) RNA by using 

DNA Ugase (Ufc Tedindogia). llie mmure ^ minsform Eschencha 
SerforJO min al 7rC THe >«»'»° ,*AeWedurM described in the 

^ting manual fof the <^^^,.^^^^ it«: uaSonned bacteria were 
ptated in soft agar, mcutettd « lot ^ ^ pooled tac- 

L describaJ P«»^ /'Ot'StSed^Tci^^. di^ with RNase A and 
lerial colonies by *« f*^.,^J^S. acewteand ethand. 
SNase T,. «.d P^'P^'f^.^^SS^SU JD cdl.. Six^enlimctet- 
rTUUla^<itV'>^^'^"ff'^ GP+E46 cells (29) were washed 
diameter culture dishes 'f (DME Stem CeU Technologic. 

™ with Dulbecco'smodiftrf Eagle siMOU^^ Af.2-hydro«yeth- 

^Z^. British aXumbi^ 2SmyS^ 7.2^2«'l'>fI'"= "^''f 
yi^^-^-^^'^^'^J^^;.!^ DEAE^totran (mol««br 
Sning 2 itg ol P'»««™fi*15^'\,^added,andthecelbwefemcubatt^ 
-^^^f»»lI^T^:^:^^^ DME-HEPESconUunmg 
for 60 min at 3rcpe '^^J^ („ DME-HEPES o>n'«""'8 1°*^^ 
10% calf serum and incubated for J a i ^ ,j^gced with 

„nm, and 200 d''^'"~\!^'L^rmeaum was replaced with 4 ml^ 
DME-10% calf senun; 45 b bt">« "J'^^ „ ,„ enhance tram«ip- 
DMtlO%call serum 

Uon from the tranrfected^»nn*(l«;^«^ 

was collected IShlater and HleredandKo^ ,^cm-diameter dish 

or 10 ^gftnl. r^, X a l^tobaUon at 3TC the 

:S^i2'n?4;^S^*S;^H-To*caUse.,un;the>nHrnculturew^ 



^..,^.th.wice^f^«.ofU..mediumfo.3to4«e.to^ 

infected NIH 3T3 ceU cultures and "^'^J^Z^a^ ettanol predpitaUoo. 

Fifty-microUter PCR reacuon '^u'^* Muences (5' primer, CrCACTCX-l 

Mtnealing to ««^^^JS^GQ. 200 ea* *■ 

ISS ?2^^^ffli SiTpolymera«> (Suaogeoe). jnd 

oxynucleoside tnphosphate, '•^.""''TThertnal cycles were as foUows: 1 at 

STbulfer providedjrith ;^P«*^''^3o'1^7«fo.300.;and 30at«^ 
95^{or60s;5at95^for60s,5ffCf« j«s.aD ^ of 13 M 

for 30 s. 5ffC for 30 s. aodj^ fo^ ,Ssa£«aed with chloro- 

aroiMnium acetate was "^"^ ^fStanol Specific PGR producu ma 
fonn and precipitated '"*±''^^^ ZT^a^T^ « P«*f 

portion of the reacnon were "^"S^^ufied DNA was then digested wi* 
S^tectingthecoampUfi^sjjpf^^ 

MIul and B»WI, and PCR Pn»«r.*rr'^ ourified PCR products were li- 
di.n,el„.ion, '^.^'^tSl^^^^S^m^'i^^ of £ 
g»ttd with ftSn^gle bacterial clones. Hasmri 

»rr^"lSHroi"tr«i»»^ 

— ra.:jrSo[VTr— gplasmK. 

RESULTS AND DISCUSSION 

sion cloning (Fig. 1). Tf^PV ' ^ . .pNA inserts during prop- 
as possible to 'n'*^'"'" P'.^fi and as retroviruses. They 
agation both as ijasmi^ ^ a stuffer fragment to 

contain a pair of ^t^^ Bsth adapters for cDNA 

auow thet^ of noncom^^ ,he genomic 

nsertion (45). inseiteo ci^i ~ ^ j^^^ transcript 

viral transcript ^h"^^^ ^ ^.m ^, 
also contains an extended Packa^ng s'p 
titers (2). Thegag in'^aUon codon w Jin^^^^^ ^^^^ 
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NIH 3T3 X CTV3 vims 




TABLE 1. Efficiencies of transfer and expression of 
transforming cDNAs 



^ 9 



8 



NIH 3T3 X CTV3CD24 virus 



1 10 100 1000 

fluorescence intensity 

nc. Z Etetcnnination of cDNA transfer and expression cffidenctes 
GP+E-86 packaging ccUs were transfected with pure pCTV3 or pCTV3CD24 
^ ""^^^ encoding the ceU surface fwotein 
CD2A (25). Nm 3T3 cells were infected with the resulting viral siipemaiants and 
2 days later were stained with an anU<:D24 monoclonal anUbody complexed 
with the fluorochrome phycoeiythrin. Expression of CD24, as measured by 
cST^"** *^ analyzed by flow cytomctiy as described previously 



would prevent cDNA transmission. The pCTV vectors also 
contain SV40 origins of replication so that the libraries can be 
used for expression cloning strategies dependent on episomal 
repUcation in COS cells (24, 45). 

Production of retroviruses. Transient transfection of retro- 
viral packaging cell lines was used to achieve the simultaneous, 
rapid, and proportionate conversion of veiy large numbers of 
cDNA clones into cDNA-bearing retroviruses. With a DEAE- 
dextran-mediated transfection protocol combined with chloro- 
quine and butyrate treatments, titers of about 10^ infectious 
viruses per ml, as determined by hygromycin resistance, were 
obtained with the GP+E-86 packaging line (29). Infection of 
NIH 3T3 cell cultures with the viral supematants produced by 
transient transfection resulted in the transfer of functional 
viruses to about 70% of the cells, as measured by expression of 
a cell surface protein encoded by a virus-borne cDNA (Fig. 2). 
Transient transfection of GP+E-86 cells with pCTV3-based 
cDNA libraries typically resulted in somewhat lower titers, in 
the range of 1 x KT* to 4 x l(r*/nil, possibly reflecting the lower 
punty of the plasmid DNA obtained directly from bacterial 
colonies of the plated library. Such titers were still sufficiently 
high to allow the transfer of 100,000 cDNA clones by using 5 
ml of supernatant for the infection of about 300,000 recipient 
cells. This is in striking contrast to our previous experience 
with direct transfection of NIH 3T3 and other cell lines, in 



Transfected plasmid(s) 



Amt of DNA No. of transformed fod 



pcrvi 

pCTVl + pCTVl/N-Ras 
pCTVl + pCTVl/N-Ras 
pCrVl + pCTVl/PoIyoma 
PCTVI + pCrVl/Polyoma 



1 

1 Jig + 1 ng 
1 iig + 0.1 ng 
1 Jig + 1 ng 
1 »Ag + 0.1 ng 



0 

-300" 
39 

-500* 
62 



• Estimated focus number, diie to high density of foci on dishes. 



which case the use of fully optimized transfection procedures 
(10, 11) and expression vectors (24, 37) resulted in cDNA 
transfer to only 0.1 to 0.5% of the recipient cells. 

Detection of rare oncogenic cDNAs within transferml viral 
populations* One of our initial concerns about using retroviral 
vectors for cDNA Ubrary transfer was the possibility that many 
cDNA clones, especially longer ones, would be inefficiently 
transmitted or partially deleted during transmission. This 
might occur either through interference of the cDNA inserts 
with the transcription or packaging of the viral genomic RNA 
or through recombination between different cDNA clones 
sharing a common packaging cell or vims particle. To compet- 
Uivefy test the fidelity of cDNA transmission, we constructed 
pCTVl vectors carrying two distinctly different oncogenic 
cDNAs, i.e., an activated N-Ras cDNA of only 650 bp and a 
2,850-bp region encompassing the entire early gene region of 
polyomavirus. In addition to its much larger size, the polyoma- 
virus clone included alternatively utilized sp«ce sites and two 
potential polyadenylation signals, features which would be ex- 
pected to be found in some cDNAs and which have the po- 
tential to block formation of the intact genomic RNA needed 
for transmission. The N-Ras and polyomavirus vectors were 
highly diluted with pCTVl vector to mimic a library containing 
rare oncogenic cDNAs. These DNA mixtures were then used 
to transfect GP+E-86 cells, and the resulting supematants 
containing released virus were used to infect NIH 3T3 cells. 
The small N-Ras clone and the large and complex poWomavi- 
ms clone were transmitted to the NIH 3T3 cells with similar 
efficiencies, which were roughly equivalent to their abundances 
within the population of plasmids used for the initial transfec- 
tion of the packaging cells (Table 1). Large numbers of trans^ 
formed foci were induced in the infected NIH 3T3 cell cultures 
even when the abundances of the vectors carrying the N-Ras or 
polyomavirus clones were only 1:10,000 relative to inert vector 
This result suggested that retrovirally transferred cDNA librar- 
les could be used to detect the presence of transforming 
cDNAs with abundances of less than 1:100.000. 

Recovery of retrovirally transferred cDNAs. To facilitate the 
analysis of the large number of transformed cell clones which 
were expected to be generated from cDNA library transfers, a 
PGR strategy was developed for rapidly recovering cDNAs 
from selected recipient cell clones and immediately reinserting 
them into the pCTV3 vector (Fig. 3). Several thousand bacte- 
rial transformants were obtained from cDNA recovery proce- 
dures performed on transformed NIH 3T3 cell clones derived 
from infection with the 10,000:1 mixture of pCTVl with 
pCTVl/N-Ras. The yields of bacterial transformants from re- 
covery procedures performed on the pofyomavirus-trans- 
formed cell clones were about 20-fold lower, in proportion to 
the lower yield of PGR amplification of the longer polyomavi- 
rus cDNA. All recovered cDNA clones which were examined 
had retained the original cDNA stmcture, with the exception 
in all polyomavirus clones of the loss of the 61 -bp intron spe- 
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1 1. PGR with flanking primers 
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2. Cut with Mhj and Bsi 

3. Ligate with A Mlu-Bsi pCTV vector 




Reconstituted 
Retroviral Vector 



^^^^UV,^^^ c'on- and used as ..pla. PGR 

actrntyurrcforcdepcodcnronr^^^ 

£. coi- arc fiiUy functional retroviral vcctors^rtiich can be immediatehT^ for tZ^nTZ ^rf^^IT^^ "^^^ P**^'* "^^^ transformed 

infection of secondary recipients. wnmeaiately used for testing the activity of the recovered cDNA by transfection into packaging celU and 



cific for the mRNA encoding the oncogenic middle T protein. 
This intron would be expected to be spliced out during virai 
transmission. Three recovered N-Ras cDNA clones and seven 
recovered polyomavirus cDNA clones were reconverted to ret- 
roviral form by transient transfection of GP+E-86 cells and 
then used to infect NIH 3T3 cells. All 10 of these clones were 
highly transforming, demonstrating that their oncogenic poten- 
tial had been faithfully maintained through the process of 
transmission, PGR recoveiy, and retransmission. 

Isolation of oncogenic cDNAs from retrovirally transferred 
hbraries. Three separate cDNA libraries derived from hemo- 
poietic cell lines were constructed in the pCTV vectors. An 
estimated 300,000 viral clones were generated from these li- 
branes by transient transfection of GP+E-86 packaging cells 
and transferred to NIH 3T3 cells by infection. CeU cultures 
infected with these libraries developed a total of 83 distinctly 
transformed foci, with a variety of morphologies and growth 
rates, while uninfected cell cultures or those infected with 
viruses derived from the pCTV3 vector alone had 3- to 10- 
fold-lower frequencies of occurrence of transformed foci, 
equivalent to the spontaneous rate of transformation of the 
NIH 3T3 cells used in these experiments. 

The foci of transformed cells which developed in the library- 
mfected cultures were isolated and expanded, and transferred 
cDNAs within them were amplified by PGR and then cloned by 
ligation into the pGTV3K vector. Plasmids containing cDNA 
inserts of the expected sizes were tested for transforming ac- 
tivity by infection of NIH 3T3 cells, and the positive clones 
were sequenced to determine their identities. The mean sizes 
of recovered cDNAs with transforming activity were very sim- 
ilar to the mean sizes of the cDNAs in the originating libraries 
(Table 2). Therefore, there did not appear to be significant 
discnmination against the transmission of larger cDNAs under 
the competitive circumstances of library transfer. 



Table 3 lists the 19 cDNAs with transforming activity which 
were recovered from PCRs performed on 47 of the trans- 
formed reapient cell clones. Twelve of the cDNAs encode 
proteins which have exact sequence identities to entries in the 
sequence databases. Three of these (Raf-1, Ect2, and Lck) are 
known to be able to oncogenically transform NIH 3T3 cells 
Transforming activity of Raf-1 and Ect2 requires N-terminal" 
truncation (28, 34, 47), as had occurred in the cDNAs cloned 
from our retroviral libraries. Lck transforms NIH 3T3 cells 
when Its kinase activity is stimulated by mutations in negative 
regulatory sites (30), but the Lck cDNA which we cloned is 
predicted to encode a protein lacking kinase activity. There- 
fore, Its weak transforming activity may occur through a dif- 
ferent mechanism. The truncated Lck cDNA included three 
out-of-frame ATG codons upstream of the first ATG in the 
Uk-encoding reading frame. When these upstream ATGs 
were removed, which would be predicted to improve the trans- 
lation efficiency of the cDNA (26), both the frequency and the 
potency of transformation were increased (Table 3). This find- 
ing implies that transformation was indeed mediated by the 
predicted translation product. 



TABLE 2. Comparison of cDNA sizes in h'braries versus 
sizes of recovered cDNAs 





Libraiy 




Recovered cDNAs 


Name 


Mean size (bp) of cDNAs 


No. 


Mean sire (bp) 


L16, L17 
L18, L20 
L19 
L25 
U7 


2,200 
1,600 
2,200 
1,800 
2,500 


6 
9 
2 

1 


2,300 
1.700 
1,600 

uoo 

2,400 
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TABLE 3. Transforming cDNAs isolated from rctrovirally transferred cDNA libraries 



cDNA" 

TL16-8CC9 
TL16-«/ATCy 
TL16.11CB1 
TL16-13cB2 
TL16-25C2 
TL17-5CA3 
TL17-llcB5 
TL17-11/ATG^ 
TL18c2 
TLlScZ/ATC?' 
TL18-2cl 
TL18-3cl 
TL18-5cl 
•nL18-6cB4 
TL18-8c6 
TL18.9C1 
TL18"10cAl 
TL19c39 
TL1^10c2 
TL2(WcB8 
TL25-12-M 
TL27-19-1-1 



Sire (bp) 

1.600 
2.800 
1.900 
2.500 
1,800 

800 

2,000 
1,600 
1,700 
UOO 
2,100 
1.800 
2.700 
1300 
1,900 
1.500 
1.200 
2,400 



Identity 

^-Catenin. N truncated 

^-Catenin, ATG enhanced 

No similarity to database sequences 

No similarity to database sequences 

Raf-1, N truncated 

Ect2. N truncated 

Lck. N truncated 

Lck. ATG enhanced 
PhosphoUpasc Oyz, N + C truncated 
Pholipase C-72, ATG enhanced 
Thrombin receptor, full length 
Thrombin receptor, fiill length 
Thrombin receptor, full length 
Thrombin receptor. C truncated 
Thrombin receptor, full length 
CDC24 GNEF family, fuD length 
Low similarity to G3S, antisense 
Thrombin receptor, C truncated 
CDC24 GNEF homolog, N tnmcated 
Spi-2 protease inhibitor, antisense 
No similarity to database sequences 
CDC24 GNEF homolog, C tnmcated 



Transforaiing 
potency* 

Moderate 
Moderate 
Weak 
Moderate 
Strong 
Strong 
Weak 
Moderate 
Moderate 
Moderate 
Strong 
Strong 
Strong 
Strong 
Strong 
Strong 
Weak 
Strong 
Strong 
Moderate 
Strong 
Strong 



0.05 
0.10 
0.01 
0.01 
0.4 
0.5 
0.005 
0.03 
0.01 
0.05 
ND 
0.1 
ND 
ND 
ND 
0.2 
0.02 
ND 
OJ 
0.05 
0.05 
03 



'Ratio of traiBfonned colonies to total ookMiies arising mk^ , . . ^ 

were oounled and scored «i transfiwnKd or nontr^ufon^ after 5 to wX, ^^Z^t^t'^ZT^ ^ "^^NA. Colonies 

adtures mfected will, the pCIV3 vector alone ranged 6om 0.0001 to 0 Oof^Ci^n^Sn^jr„^°H ^ 1»"'»»^<»^^«fonnation ftequende, of NIH 3T3 

'The sequences flanUng presumptive rtan cXs in th^TcDNAfwerillS^^S^Tl^V ^"""""".^^ncy of 0.7. ND. not detennined. 

t^n^^aedst^^toren^veoutlo^^^^^^^^^ 



The other cDNAs represented in the databases were derived 
from genes which were not previously identified as having 
transforming activity, and therefore their mechanisms of trans- 
formation are speculative. Thrombin receptor expression and 
Its cleavage by thrombin in the serum used for cell culture 
presumably activates a G-protein<oupled mitogenic sienalline 
pathway (27, 48) in NIH 3T3 cells, as has been demoStrated 
tor Ugand-stimulated serotonin and muscarinic receptors (18 
23). The transforming effects of the truncated ^-catenin may 
be due to its disruption of the function of P-catenin-associated 
cell adhesion molecules such as E-cadherin (19). leading to 
transformation via loss of contact-induced growth repression 
Alternatively, expression of truncated p-catenin may itself trig- 
ger a mitogenic signal transduction pathway in NIH 3T3 cells 
The mechanism of action of the truncated phospholipase C-v, 
IS even more obscure, as it does not encode the catalytic do- 
mains required for mitogenic signal transduction (14). It could 
conceivably act by interfering with the function of an unknown 
inhibitor of the endogenous phospholipase C-7, of NIH 3T3 
cells. As seen for the Lck cDNA, the transforming activities of 
the truncated p-catenin and phospholipase C-72 cDNAs were 
increased (Table 3) when upstream ATGs were removed and 
the sequence contexts of the presumptive initiation codons 
were optimized for translation (26). 

The transforming activity of the antisense Spi-2 cDNA was 
presumably due to an inhibitory effect on the stability or trans- 
ation of mRNAs encoding the Spi-2 protease inhibitor (20) 
leading to increased secreted protease activity and disruption 
of cell-cell or cell-malrix contacts which normally repress pro- 



The remaining seven of the transforming cDNAs do not 
have high sequence identity with any entries in the DNA se- 
quence databases. Three of these cDNAs encode proteins 
which include domains with homology to the CDC24 family of 
guanine nucleotide exchange factors (6) and therefore are pre- 
sumed to transform NIH 3T3 cells by constitutively activating 
signal ti^uction through Ras-Iike proteins. One of the other 
novel clones has an antisense-oriented reading frame with 
weak but significant sequence similarity to beta subunits of G 
proteins and thus potentially acts as a suppressor of an endo- 
genous G-protein-coupled signal transduction pathway which 
inhibits transformation of NIH 3T3 cells. 

Of the 14 different types of cDNAs which were isolated in 
our screening of the retroviral cDNA libraries. 8 encoded pro- 
teins which are known or strongly implicated as positive induc- 
ers of mitogenic signals, and one (TL18-10cAl) may be an 
antisense suppressor of an inhibitor of mitogenic signals. This 
high yield of known or presumptive growth regulatore confirms 
the vahdity of using the NIH 3T3 focus formation assay for 
detecting cDNAs encoding proteins with these functions The 
fact that four of these cDNAs had not previously been cloned 
confirms our expectation that many novel growth regulators 
remain to be cloned via screening for transformation of NIH 
3T3 cells or other transformation-susceptible cell lines. Given 
t^e proportion of growth regulators isolated in our screen 
there is a high probability that the three novel cDNAs with no 
obvious coding sequence similarity to known proteins will also 
prove to be bona fide growth regulators. Alternatively, one or 
more of these cDNAs may encode or repress the expression of 
proteins which morphologically transform NIH 3T3 cells by 
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altering cell surface interactions (possible examples being the 
api-2 and 3-catenin clones). 

Some of the cDNAs described above appear to have ac- 
quired tramrforming activity via mutations arising in the pro- 
cess of cDNA library construction, i.e.. truncation or inversion. 
The oqprrasion of such altered cDNAs can therefore be critical 
to the induction of a desired phenotype. either through the 
removal of negative regulatoiy domains from a phenotype 
inducer (e.g.. truncated Raf-1 and Ect2) or by the generation 
of suppr^rs of pheno^ repressors (e.g.. antisense Spi-2). 
The possible dependence on specific mutation for the revela- 
tion of a phenotype such as oncogenic transformation means 
that only a small subset of the cDNAs derived from transcripts 
of a given gene may be functionally detectable. Furthermore 
many of the cDNAs cloned in this study had transformation 
frequencies considerably less than 1. While this may in part be 
due to the transmission of a partially deleted virus or to pro- 
moter mteiference resulting in the inactivation of the LTR in 
most cases it is Ukely due to critical thresholds of expression 
"^x,^"JS.'"^^"* *^ °' Proviruscs can^ig the 
^^S^rP"?* ^° and screen ven- large 

of sudi cm5!^*^° ^ requirement for the cloning 

The Sm 3T3 cell line is obviously a very effective recipient 
cell for detecting and cloning oncogenes. This is largely be- 
rause of Its high susceptibility to transformation by the emres- 
sion of proteins which activate the Ras-Raf signal fransduction 
patiiw^. When expressed in the C3H10n^ fibroblast line 
only the truncated Raf cDNA and the TL19-10c2 cDNA 
caused fransfonnation, while all otfier cDNAs listed in Table 3 
were negative. However, we have recentiy used tiie C127 epi- 
thelial cell line as a recipient for cDNA library transfer and 
have isdated one cDNA encoding a putative franscription 
.transforms this line but does not fransform NIH 
3T3 celb. This implies that the use of alternative recipient celb 
can either resfrict or expand the specfrum of fransfonning 
cDNAs which can be detected and cloned 

The 19 fransfonning cDNAs listed in Table 3 were isolated 
from a relatively small number of fransferred clones, reore- 
sentmgjust 7% of the 4 million clones in the available libraries. 
The scale of our library screens has been restricted not by the 
number of cDNA clones which can be ti^nsferred but rather by 
tiie number oftransforming cDNAs which can be processed 
and analyzed. There are presumably many more transforming 
cDNAs remammg to be detected in these libraries. Of Uie 14 
different types of fransfonning cDNAs isolated, multiple inde- 
pendent clones were obtained only in the case of the thrombin 
f^fn n ^f'a'i^ely high abundance of its mRNA 

n 32p cells (25a) and the apparent ability of this receptor to 
transform NIH 3T3 cells without a requirement for specific 
truncating mutations. 

Two naajor advantages were obtained through the use of 
retroviral libraiy transfer for the isolation of cDNAs capable of 
inducing proliferation of NIH 3T3 ceils. The first was the 
ability to screen very large numbers of cDNA clones on an 
equivalent number of recipient cells. This resulted in a high 
Z f "F^ ""^"^ ^""^ spontaneous transfonnation 
^'^ "'^ °^ '^*=°^«'y °f transforming versus 
iner^ cDNAs. The second advantage was the relatively high 
nM* e^^^'on obtained with retrovirally tranrferred 
cDNAs. which m our hands are 10-fold greater on average than 
those obtained with the best available stable transfection vec- 
tors. Efficient Mpression can be critical for obtaining transfor- 
mation and other targeted phenotypes in recipient cells, par- 
ticularly when an antisense mechanism is involved 
A third major advantage of retroviral cDNA library transfer 



not exploited in this study, is the potential to use cell lines 
which have previously been completely inaccessible to expres- 
sion cloning because of low transfection efficiencies. There are 
onty a Imuted number of fibroblast and epithelial cell lines 

nii™ lin iiL "'n'rast. almost all ceD lines and some pri- 
mary ceU types can be mfected with retroviruses. We have 
Sehcrating retroviral cDNA libiiries wiU, die 
BOSC23 packaging ceU Ime, which was established for die 
f^^K^ of yielding high viral titens via fransient frani^ 
fection (41). The viral Uters of up to 10«/ml obtained widi th^ 
packaging cells are sufficiently high to enable die fransferl* 
million-member cDNA libraries to hemopoietic cell lines and 

?n J!i'*"°u °^ '^^^ conferring selectable phe- 
notyp« such as conversion to growth in the absence of «to- 
kines (50 Use of the BOSC23 cell line should pennit th^ 

ti^ansfer of large CDNA libraries to cell lines represeSti^mS 
different hneages as well as to some primary cell tvoJ Thk 

would make retroviral fransfer of cDNriibSiroqS 

suitaWc for die cloning of cDNAs which can impose nXl 

no^pes or complement genetic defects in specialized cell 
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ABSTRACT RNA and ribonuclease-resistant RNA ana- 
logs that bound and neutralized Rons sarcoma vims (RSV) 
were isolated from a large pool of random sequences by 
multiple cycles of in vUro selection using infectious viral 
particles. The selected RNA pool of RSV-binding sequences at 
a concentration of 0.16 $M completely neutralized the vims. 
Of 19 sequences cloned from the selected pool, 5 hihibtted RSV 
infection. The selected RNA and RNA analogs were shown to 
neutralize RSV by interacting with the virus, rather than by 
adversely affecting the host cells. The selection of the anU-RSV 
RNA and RNA analogs by intact virions immediately suggests 
the potential application of this approach to develop RNA and 
RNA analogs as inhibitors of other viruses such as human 
immunodeficiency virus. 



In comparison to the large numbers of antimicrobial agents 
against bacteria, few effective antiviral drugs have been de- 
veloped (1, 2). The most widely used approach to the discovery 
of antiviral agents has been the empirical screening of chem- 
ically diverse classes of synthetic and natural compounds for 
the ability to inhibit vims replication (2, 3). However, the 
empirical screening is usually tedious and inefficient. Other 
approaches such as rational drug design based on the known 
structures of viral and cellular proteins are promising but to 
date have not been largely successful (3). 

RNA molecules have a remarkable diversity of stnicture and 
function (4, 5). The structural complexity of a combinatorial 
sequence library of single-stranded RNA oligonucleotides 
provides the potential to select for molecules that bind defined 
targets with high affinity and specificity (6-8). Because the 
selected RNA molecules can be amplified and subjected to 
further selection, multiple cycles of such selection and ampli- 
fication can lead to the isolation of the best target-binding 
RNA molecules from the library. This in vitro selection- 
amplification procedure has been previously used to isolate 
RNA (and DNA) molecules that bind a variety of small 
molecules and purified proteins (9-12). We have extended the 
procedure to a new type of target, an intact virus, to isolate the 
RNA molecules and the nuclease-resistant RNA analogs that 
specifically bind and neutralize the virus. 

Rous sarcoma virus (RSV), an avian retrovirus, is among 
the best-studied members of the family Retroviridae, which 
includes human immunodeficiency virus (HIV). All members 
of this family arc enveloped Viruses that display on their 
surface glycoproteins which are required for binding to a 
specific receptor on a susceptible cell and for fusion with the 
cellular membrane (13. 14). Here we use RSV to demonstrate 
that without knowledge of the structures of viral proteins, 
antiviral RNAs and RNA analogs can be isolated systemati- 
cally and effectively from a large pool of random sequences. 



The publicatton cosu of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement** in 
accordance with 18 VS.C $1734 solely to indicate this fact 



first by using intact viral particles to select from the sequence 
pool the RNAs and RNA analogs that bind specifically to the 
virus, and subsequently by screening the selected molecules for 
the ability to neutralize the virus. We reason that by binding 
to virus, some of the selected RNAs and RNA analogs may 
change the structures of viral surface proteins so that these 
proteins can no longer function in steps oitical for viral 
infection, such as viral attachment and virus-cell membrane 
fusion. Alternatively, some of the structural changes may 
trigger pathways to inhibit the steps whidi normally occur after 
virus internalization, such as the uncoating and the expression 
of the virus genome. Both mechanisms-have been previously 
suggested for many similar virus neutralization reactions in- 
duced by the binding of antibodies (15, 16). 

MATERIALS AND METHODS 

Materials. Nucleoside 5 '-triphosphates were ptu-chased 
from United States Biochemical. T7 RNA. polymerase was 
purified from the overproducing Escherichia coli strain BL21 / 
pAR1219 (17). Superscript 11 reverse transcriptase was from 
GIBCO/BRL. Tag DNA polymerase was from Promega. 
Human plasma was provided by David Fanell (Pennsylvania 
State University). Prague A strain of RSV and anti-RSV serum 
were prepared as described (18-20). 

Construction of a Large RNA Pool of Random Sequences. 
A DNA library containing -5 X 10" sequences (2.5 mg of 
DNA) was constructed by automated solid-state synthesis. The 
sequence diversity was generated by randomizing a central 
40-nt region (40N) of the 87-nt oligomer 5'-GCC-GGA-TCC- 
GGG-CCT-CAT-GTC-GAA-40N-TTG-AGC-GTT-TAT- 
TCT-GAG-CTC-CC. The 5'-end primer (5'-GCC-GGA. 
TCC-GGG-CCr-CAT-GTC-GAA-3') and 3'-end primer (5'- 
CCG-AAG-CTT-AAT-ACG-ACT-CAC-TAT-AGG-GAG- 
CTC-AGA-ATA-AAC-GCr-CAA-3'. containing a T7 RNA 
polymerase promoter) were made for the polymerase chain 
reaction (PGR) to amplify the 87-mer. Large-scale PGR of the 
DNA library resulted in double-stranded DNA (-5 x 10" 
sequences) which was transcribed by T7 RNA polymerase to 
generate a pool of multiple copi^ of <*10" RNA sequences 
(random pool). 

Transcription of RNAs and 2'-Fluoropyrimidine-Contain- 
ing RNAs (2'-F-RNAs). Unmodified RNAs were transcribed in 
vitro by T7 RNA polymerase at 3T*C for 2-3 hr in a buffer 
containing 1 mM each ATP, CTP, GTP, and UTP; 60 mM 
Tris-HCI (pH 8.0). 12 mM MgQz. 1 mM spermidine, 5 mM 
dithiothreitol. and 0.001% Triton X-100. 2'-F-RNAs vrerc 
transcribed at SS'C for 8 hr in the same buffer except that 1 



Abbreviations: RSV. Rous sai ;cma virus; HIV, human immunodefi- 
ciency virus; 2'-F-RNA, 2'-fluoropyrimidinc-containing RNA. 
♦Present address: Residency Program, Department of Medicine, Poly- 
clinic Medical Center. Harrisburg, PA 17110, 
»To whom reprint requests should be addressed. 
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mM 2'-F-CrP and 2'-F-UTP replaced CTP and UTP. The 
fiill-lcngth transcripts were purified by polyacrylamide gel 
electrophoresis. 

In Vitro RNA Selection by RSV Particles. The RNA (or 
2'-F-RNA) sequence pool was incubated with RSV (Prague A 
strain) for 10 min at 3rC in 2,5 mM MgC^lOO mM NaCl/20 
mM Tris-HCl, pH 7.5. The mixture was then filtered through 
a prewetted nitrocellulose membrane. The RNA-virus com- 
plexes were recovered from the nitrocellulose membrane after 
the unbound RNAs were washed away. The RNA molecules 
which bound anything else but RSV (e.g.. the nitroceUulose 
membrane, the cellular proteins contaminating the virus so- 
lution) were removed from the RNA pool by a background 
selection either prior to or after the selection by virus. The 
RNA-RS V complexes were denatured and Viral proteins were 
removed by phenol/chloroform extraction. The RNAs were 
then recovered by ethanol precipitation and reverse- 
transcribed to their cDNAs by Superscript II reverse tran- 
scriptase. The cDNAs were PCR-amplified and subsequently 
transcribed to give a new pool of RNA which was enriched for 
better RSV-binding sequences. This new pool of RNA was 
used for selection by intact RSV to begin the next cycle. The 
selection stringency was increased by lowering the RSV con- 
centration as the number of selection cycles increased. The 
RNAH^irus complexes were also separated from the unbound 
RNA by high-speed ccntrifugation rather than filtering during 
cycles 4 and 9; the RNA-^irus complexes formed a pellet along 
with any unbound virus while the unbound RNA remained in 
the supernatant. After 12 cycles, the selected RNA pool was 
tested for the ability to bind and neutralize RSV. 

Binding of the Selected RNAs to RSV, The ^zp-labeled RNA 
pool (or a cloned RNA sequence) obtained after 12 cycles of 
selection was incubated at 0.2-0.5 nM with various concen- 
traUons of RSV for 15 min at 3TC in 2.5 mM MgClj/lOO mM 
NaCI/20 mM Tris HCI, pH 7.5. The RNA-virus mixtures were 
filtered on prewetted nitrocellulose membranes; >95 of the 
RNA-RSV complexes were retained on the membrane, and 
the unbound RNAs were washed away by. the binding buffer. 
The radioactivity retained on the membranes was measured by 
liquid scintillation counting and corrected for any background 
due to the binding of RNAs to nitrocellulose membrane. 

Vims-Neutralizing Activity of the Selected RNAs. Fresh 
RSV suspension (Prague A strain, •-lO' focus-forming units/ 
ml) was supplemented with 2J mM MgQz, mixed with the 
selected RNA pool or a specific RNA sequence, and incubated 
at 3TC for 15 min. The RNA-virus mixture was placed onto 
10* quail fibrosarcoma cells (QT6 cell line), incubated for 1 hr 
at 3T*C in a 5% CO2 atmosphere, and then replaced with 
complete growth medium (19). At 18 hr postinfection, the cells 
were labeled with L-pS]methionine for 2 hr. Viral proteins in 
cell lysates and media were recovered by immunoprecipitation 



with an anU-RSV serum and analyzed on 12% polyacrvl- 
amide/0.1% SDS gels (19, 20). The yields of viral proteins on 
the gels were estimated by densitometry. 

RESULTS 

In VUro Selection of RSV-Bindlng RNA Sequences. An RNA 
pool ot j»\Q^ different sequences, each 87 nt long, was 
subjected to selection by sucrose-gradient purified RSV par- 
ticles to isolate the RNA molecules that bound the virus. The 
RSV-bmdmg RNA sequences were then amplified and sub- 
jected to the next cycle of selection at a lower RSV concen- 
tration to mcrease the selection stringency (Fig. 1). Multiple 
cycles of selection and amplification resulted in an exponential 
mcrease of tiie best RSV-binding RNA molecules. After 12 
cycles, the RSV concentration required to bind 50% of '-0.5 
nM selected RNA pool (dissociation constant, Kd decreased 
to fig of viral protein per ml, which was at least 1000 times 
lower tiian tiiat required to bind 50% of tiie pool of random 
sequences (Fig. 2). The RSV concentration was expressed in 
terms of total viral protein concentration because the molar 
number of viral particles could not be accurately measured 
Sequencing of the selected RNA pool revealed that Uie RNA 
sequences were no longer random and particular sequences 
appeared to dominate Uie population. The selected RNA pool 
was then tested for its ability to neutralize RSV 

Neutralization of RSV by the Selected RNA Pool. In tiie 
absence of any added RNA, avian cells (QT6) were efficiently 
mfected by RSV as shown by tiie large amount of newly 
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Fig. 2. Binding of 0.2-0.5 nM ^^P-Iabclcd RNA to RSV a. 
Selected RNA pool obtained after 12 selcction-amplificalion cycles- 
O, random RNA pool. Because the detection limit for the ^zp-Ubcled 
RNA was about 0.05 nM and the affinity between the selected RNA 
pool and RSV was high, the measured percentage of the bound 
selected RNA at low concenUations of RSV may be limited by the 
available binding sites on the virus rather than by the dissociation 
constant (K,). Thus, the of 3 /ig of viral protein per ml derived from 
such measurements may be an estimate of its upper limit 
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Fig. 3. Effect of the selected RNA pool on RS V infection of QT6 
quail cells and on host cell protein synthesis. {A) InhibiUon of RSV 
production. Viral proteins in cell lysatcs and media were recovered by 
immunopredpitaUon with an anti-RSV serum and analyzed on 12% 
polyacrylamide/0.1% SDS gels. Pr76«»«, fiiU-length Gag protein; 
PrlSOWPo*, Gag-Pol fusion protein; CA (capsid proteins) and PR 
(protease), mature cleavage products derived from Pr76w. The RNA 
concentratipn m each virus sample is shown above each lane. However, 
the effective RNA concentration in each sample should be lower than 
the concentration shown here and in Fig. AA, because up to 70% of the 
iniUally added RNA was found to be degraded within 75 min by 
nucleases m the RSV suspension Unlike the sucrose gradient-purified 
RSV used for the selection, the RSV suspension used for the infection 
^ay was freshly prepared and not purified, and thereby was more 
likely to be contaminated with ribonudeascs from cell culture. (B) 
Effect of the selected RNA pool on the host ccU protein synthesis. QT6 
cells were treated in the absence of RSV with (lane 1) and without 
(lane 2) 500 nM selected RNA pool under the same conditions as used 
in A. After radiolabeling. total cellular proteins were analyzed by 
electrophoresis and autoradiography. 

synthesized viral Gag (Pr76*«) and Gag-Pol (PrISOR"*-***) 
proteins in cell lysates, and capsid proteins (CA) and viral 
protease (PR) of the Gag cleavage products in growth media 
(Fig. 3/1. lane 5) (19, 20). The presence of random-sequence 
RNA molecules at 330 nM had no effect, within experimental 
fluctuation, on the production of either the viral proteins or 
the background cellular proteins such as the 29-kDa band (Fig. 
14. lanes 3 and 4). In contrast, the selected RNA pool at 20 nM 
reduced the yield of viral proteins by 85-92% (Fig. 2A, lane 2. 
estimated by densitometry). At 160 nM. the selected RNA pooi 
completely blocked RSV production in this assay (Fig. 14, lane 
1). The selected RNA pool did not affect the yield of the 
cellular proteins to a detectable extent (Fig. 14, compare the 
background bands in lanes 1 and 2 with those in lane 5). Thus 
the RNA pool of selected sequences appeared to block RSV 
production by neutralizing the virus, rather than by adversely 
affecting the host cells. Such specific inhibition of RSV infec- 
tion by the selected RNA pool and the cloned individual RNA 
sequences described below was reproduced in five and four 
independent sets of experiments, respectively. 

The effect of the selected RNA pool on the host cell's 
growth and susceptibility to viral infection was further tested. 
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Fic. 4. (4) Inhibition of RSV by indhridual RNA sequences. The 
infecttons were performed in the same way as for the selected RNA 
pool (Fig. 3). In this particular experiment, viral protein Pr76W 
ove rlapped with some of the background cellular proteins (comparison 
with lane 6). For lane 7. the QTS cells were treated with the selected 
RNA pool (500 nM) for 1 hr and washed with medium prior to 
infection by RSV. {B) Nucleotide sequences of the RNAs that 

2T2cl"*"?^P"« ^"^'^ ^ ^» Sequence A bound 

Dotii RSV and mtroceUulose membrane and was the most abundant 
sequence m the selected pool but did not inhibit RSV. Each sequence 
consists of a 5 ' fixed region, the variable 40 nt (40N) regwn (umcrcasc 
letters), and a 3' fixed region. Bold letters show the groups of guanine 
nucleotides that appear at similar positions in at least two sequences. 
The underlined regions are the minimum sequences required for 
speafic binding to RSV, as determined by the partial alkaline hydro- 
lysis method (21). ^ 

Exposure of uninfected QT6 cells to the selected RNA pool at 
500 nM did not affect cellular protein synthesis (Fig. 3B). 
Furthermore, pretreatment of cells with the selected RNA 
pool at 500 nM for 1 hr prior to infection by RSV did not 
change the yield of viral proteins (Fig. 4>4, compare lane 7 with 
lane 5), indicating that the selected RNA pool had no effect on 
the susceptibility of the cells to RSV infection. In addition, the 
selected RNA pool showed no higher affinity for the host cells 
than did the random RNA pool (unpublished data), indicating 
no specific binding to the cells. AU these results were consistent 
with the conclusion that the antiviral activity of the selected 
RNAs was due to their specific interactions with RSV. 

Specific RNA Sequences That Neutralize RSV. To identify 
specific RSV-neutralizing sequences in the selected RNA pool 
we cloned the DNA templates for the selected RNA pool into 
pUC19 vectors and sequenced 36 clones. The most prevalent 
sequence (sequence A) appeared in 9 clones, followed by 
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sequences B. C D. and which were found in 4, 4, 3, and 2 
clones, respectively. Each of the other 14 clones had a unique 
sequence. Among the 19 sequences identified from the 36 
clones, RNA transcripts of 5 sequences (B, E. F, G, and H) 
showed RSV-neutralizing activity (Fig. 4). Because in the 
particular experiment shown in Fig, 4A viral protein Pr76«»« 
overlapped with some of the background cellular proteins, 
which vary from one experiment to another (lane 6), the 
comparison of RSV production between different lanes may be 
better judged by the yields of Prl803»«-i»* (in the lysates) and 
capsid proteins and protease (in the media). It was clear that 
sequences E, F, and G at 40 nM inhibited RSV as effectively 
as the selected RNA pool at 80 nM (Fig. 4A, compare lanes 3 
8, and 9 with lane 1). Sidc-by-sidc experiments showed that 
sequences B and H were 2- to 3.foId as effectwc as sequences 
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H««H^ ^1 ^'^^"''y 2'-F-RNA of sequence H against 

degradation by nucleases in human plasma The RNA and the RNA 

M^^r?^^T^*^^ P'^*"* supplemented with 2J mM 

MgQz at 3TC for the times shown. Lane 5 shows the Mg2*-enhanccd 
background hydrolysis in 48 hr. The half-life of the RNA analog in 
human pUsma (22 ± 4 hr) was about the same as those of the analog 
pools of the selected and random sequences. (B) Inhibition of RSV by 
?u f ^r"** sequences- The analog pool wZ 
transmbed from the pool of sequences that was generated^ter 12 
cycles of selection with an unmodified RNA pool. 
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E, F. and G in neutralizing the virus (unpublished data) 
Sequence A bound to RSV with an affinity comparable to that 
of sequences B and H(Ks^2^3 m8 of viral prot^per m?) bu 
had at least 4 times higher affinity for nitrocellulose mem' 
branc. which might explain why it was the most frequent 
sequence among the clones. Comparison of the cl ned se- 
quenas revealed that groups f guanine nucleotides appeared 
at smiilar positions in aU the RSV-neutralizing RNA sequences 
(Fig. 45), but no consensus could be discovered amonfl their 
secondary structure (base-pairing) diagrams that wercgcner- 
atcd by energy minimization (22). 

Neutralization of RSV ^'-F-RNA Analogs. One of the 
potenual problems in developing RNA as a therapeutic agent 
IS the rapid degradation of unmodified RNA by ribonudcascs 
present in vivo. This problem may be so^cd by a number of 
fu^ V^"""^.^ incorporating modified nucleotides into 
the RNA chain (23, 24). Indeed, the incorporation of 2'. 

fi^'SMA'^'^^^uyiJ^'"'' 2'.fluoro-2'-deoxyuridine into 
tne RNA Cham by T7 transcription increased the RNA stability 
agamst the nuclease digestion by more than 2 orders of 
magnitude (Fig. 5A). A similar increase in RNA stability was 
observed when either the RNA pool of random sequences or 
the selected RNA pool or sequence H was tested (unpublished 
data) mdicating httle dependence on RNA sequence (pic- 
sumably each RNA chain contained about the same number of 
pynmidines). 

A pool of 2'-F-RNAs of the same selected sequences as used 

lpl ilK?^"*'''^>I^^^.4^*^°" ^ "^^^ (comparison of 
Une 1 with lanes 3 4 and 5 of Fig. SB by densitometry) and 
-75% (unpubhshed data) when used at 140 nM and 1(X) nM 
respectively. In contrast, the level of 85% inhibition wa^ 
achieved by only 20 nM unmodified RNA pool (Fig 3A lane 
2), indicating that the analog pool was 7-10 times less eff^ve 

a 20) nM had no effect on RSV infection (Fig. 55. lan^ 3 and 
4). Neither selected nor random sequences in the analoa Dool 
at up to 200 nM affected host cell protein synthesis (Fig. 5B 
lan^ 1 and 4). Because each of the 12 cycles of selection was 
performed with unmodified RNAs, it was likely that the lower 
anti-RSV activity of the RNA analogs was due to the RNA 
structural changes induced by the 2'-fluoro modification in 
pynmidmes. Nevertheless, the neutralizing activity of the 
analogs at high concentrations implied that some structural 
features of the unmodified RNA important for antiviral ac- 
tivity remained. 
To test whether selection of RNA analogs would lead to a 

II^i^i^ir^Ji!?f ""^^^'^^^ act^ity, we transcribed a 
pool of 2 -F-RNAs from the sequences that were generated 
after 9 Qrcles of selection by RSV from an unmodified RNA 
pool and then continued the selection with the analog pool for 
another 3 cycles. The resulting pool of 2 '-F-RNAs inhibited 
RSV production by 75% when used at 22 nM (unpubhshed 
data), whereas the same level of inhibition was achieved at 100 
nM for the analog pool transcribed after 12 cycles of selection 
oma"T°^**?^'* P^'- '^"s* selection with 

: c r . 1^ ^ ^^'^ improved the anti-RSV acth^ity by 

4.5.fold. An even more potent anti-RSV analog pool would 
have likely been obtained if all the 12 cycles of selection had 
been done with analog pools. Nevertheless, it is clear that 
nbonuclease-resistant antiviral RNA analogs can be obtained 
either by selecting from an unmodified RNA pool and then 
transcribing the selected sequences with modified nucleotides 
or by direct selection from an RNA analog pool. 

DISCUSSION 

Because of the permeation problem imposed by the RSV 
membrane, the selected RNAs are expected to inhibit RSV 
infection by binding to the virus surface and not by enterine the 
virus. However, the identification of the RNA binding sit^ on 
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RSV and f the roles of these viral components in the virus 
infection has not yet been completed. 

Selection f6r RNAs InhibltlQs Moltiple Strains of Vims. 
Although the selections were pwf rmcd with the Prague A 
strain of RSV, preliminary results indicated that the selected 
RNAs also inhibited the Prague C strain of RSV. The RNA 
concentration required t inhibit the Prague C strain was about 
15-20 times higher than that needed to achieve the same level 
of inhibition of the Prague A strain (unpublished data). The 
surface glycoprotein (gp85) of these two strains contains 
conserved regions (-95% homology) as well as three or four 
smaU variable regions (subgroupKletcrmining regions) (25. 
26). If the selected RNAs and analogs neutralize RSV by 
mteracting with the surface protein, it is likely that a small 
number (e.g., -5%) of the Prague A-neutraliring sequences in 
the selected pool bind to the conserved regions of m85 and 
thus also neutralize the Prague C strain. 

The selection may also be performed by alternating between 
several strains of a vkus during the different cycles of selection 
to specifically isolate Uie RNAs and RNA analogs that would 
bmd to tiie viral conserved structural motife and tiiereby 
mhibit diverse strains of the vims (S. Dallabrida, J. C Sanders 
E. M. Eyster, and J.-RW., unpublished data). The isolation of 
RNAs and RNA analogs tiiat are effective against dwerse viral 
strains will be particularly necessary for inhibiting viruses such 
as mv, which has a high mutation rate and readily develops 
drug-resistant strains. Such selection is possible and practical 
because a very large structurally dwerse library is used and Uie 
selection is a rapid process. 

Selection by Intact Blological Entities. Altiiough an isolated 
vu^ protem can be used in tht in vitro selection for tiie 
protem-bmding RNAs and some of tiie selected sequences may 
inhibit viral infection, tiicrc arc at least tiiree advantages to tiie 
use of mtact viral particles in tiie selection, (i) The selection by 
mtact virus does not require a fiiU understanding of tiie usually 
very complex mechanism of viral infection, whereas tiie se- 
lection by an isolated viral protein is limited to only a few cases 
m which tiie proteins responsible for tiie viral infection have 
been identified and isolated. («) Because of structural differ- 
ences m tiie protein (27, 28), tiie RNA selected to bind a viral 
protein m its purified form may not interact witii tiie protein 
complcxed on a virion, (m) TTic selection using intact vims may 
lead to tiic identification of viral components tiiat have not 
previously been known for tiieir critical roles in viral infection 
The success of isolating numerous anti-RSV RNAs and RNA 
analogs from a large pool of random sequences immediately 
^ffiests tiie potential appUcation of tiiis approach to isolate 
RNA and nudease-resistant RNA analogs agaiijst otiier vi- 
ruses sudi as HIV. ^ S-msi oincr VI 

The sel^on of RNAs (or DNAs) and tiieir analogs may 
also be performed by otiier biological entities such as bacteria, 
yeast, and tumor cells (S.G., W.P., and J.-F.W., unpublished 
data). The selected nucleic acids and their analogs would bind 
to these cells and may change tiieir biological functions by 
affcctmg cellular mteractions such as signal transduction path- 
ways. Therefore, tiie selection of RNAs and RNA analogs by 
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intact viruses or ceils may provide valuable tools for studyuw 
virus-cell and intercelhtiar interacti ns. The selected RNAs 
and analogs may be used to control such interacti ns so tiiat 
the mfecti n by virus, bacterium, r yeast or tiie proliferation 
or death of cells may be inhibited. 

WetiiankJ.E.Hoppcr,D.Spector,M.Katzman,andO.Uhlcnbeck 
for discussions and A K Hopper, R. Shiman, and L Parent for 
comments on the manuscript This work was supported by Pcnnsyl- 
j^iaState University Start-up Fiinds (J.-F.W.), National luttitutesof 
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Summary 

A mutant of Moloney murine leukemia virus (M- 
MuLV)> pMOV-^~, was constructed by deletion of 
at>out 350 nucleotides from an infectious proviral 
DNA clone between the putative env mRNA 5' splice 
site and the AUG that initiates the coding sequence 
for Pr65^. Although the parent wild-type proviral 
clone, pMOV-^% quickly causes a high level of re- 
verse-transcriptase-containing virus particles to be 
released from transfected NIH/3T3 cells, transfec- 
tion of pMOV-^~ into these cells initially results in 
very littie release. By 9 to 10 days after transfection, 
however, pMOV-^~-transfected cells produce infec- 
tious virus. Thus pMOV-^~ has a defect that can be 
repaired in transfected NIH/3T3 cells, presumat>ly 
by recombination with a sequence normally present 
in the celts. Cell lines with pMOV-f ~ stably inte- 
grated into chromosomal DNA produce reverse-tran- 
scriptase-containing particles that lack detectable 
M-MuLV RNA but the cells effidentfy complement 
replication-defecthre, packagable retroviruses. Thus 
pMOV-^~ has a defect in the packaging of genomic 
RNA into virions but can provkie in trans the prod- 
ucts necessary for virion production. The deletion in 
pMOV-^~ appears to define a site required in c/s for 
packaging of MuLV RNA into virions. Cell lines car- 
rying pMOV-^~ can be used to produce helper-free 
stocks of natural or synthetic defective retroviruses. 

Introduction 

Reuovirus RNA is packaged into virions as a 70S dimer of 
two identk:al, capped and polyadenyfated 35S RNAs (for 
review see Varmus, 1982). How the virus achieves selec- 
tive packaging of these RNAs and excludes other mRNAs 
remains one of the more obscure aspects of the retrovffus 
life cyde. One possd)le explanatk)n is that there is a site 
on the viral gerxyne wtiich interacts with a vmon protein to 
direct spedfk:aDy the packaging of the RNA. If such a site 
were deleted from a viral genome, the resdt wodd be a 
cis defect preyentvig the erx^apsklation of Qenomac RNA. 
Such a mutant, however, shodd stSl be capat)le of (Erect- 
ing the synthesis of afl viral protetf^. . 

In searching for a cis packaging signal we noted that 
retroviruses oackaoe full-length RNA txit not the spiced 



al.. .1978; Levin ar)d Seidrnan. 1979; Stoltzfus and Kuhnert. 
1979). The ability of the virus to differentiate between these 
RNAs suggested that an essential part of a packaging 
signal might be in the regk)n spliced out to form the env 
mRNA. If such a site was outside the coding region for 
Pr65^ and reverse transcriptase, it woukJ have to be 3' 
of the presumed splice donor site and 5' of the AUG that 
initiates translation of Pr65^. 

A second clue to the locatk>n of a packaging site came 
from the study of a Rous sarcoma virus-transformed quail 
cell line that is deficient ffi packaging viral RNA (Unial et 
al.. 1978). This mutant has a 150 t)p deletion somewhere 
t)etween 300 and 600 t>p from the left end of the provirus 
thereby implicating once again the regkxi upstream from 
the start of Pr65^ as important for packaging (Shank and 
Unial. 1980). A complk:atir)g property of this mutant rs that 
it is frar7s-dominant: wild-type genomes are also tnefft- 
cientty packaged upon superinfection, suggesting that a 
second mutation exists in some ''packaging factor" which 
relaxes the normal specificity for Rous sarcoma virus RNA 
(Linial et al.. 1978; Unial, 1981). 

In this study, we describe the deletion from Moloney 
murine leukemia viais (M-MuLV) of approximately 350 bp 
between the left tong terrninal repeat (LTR) ar»d the start 
codon for Pr65^. The resulting mutant has a c/s-active 
deficiency for packaging of gerx>mk: RNA arxi can be 
used to generate pure stocks of defective retroviaises. 
including recombinant retroviral vectors. Watanabe and 
Temin (1982) have provided evidence for a similar site in 
an avian retrovirus. 

Results 

Construction of pMOV-f 

To examine the functkxi of sequences upstream from the 
start of Pr65*w and downstream from the env mRNA splice 
donor site in M-Mii.V. we deleted these sequences from 
a cloned DNA representatk)n of the genome. As detailed 
in Figure 1 . the deletkxi was made from a Bal I site to a 
Pst t site, and a Hind Rl site was generated at the point of 
deletkKi. Based on the published sequence of M-MuLV 
(Shinnick et al.. 1981), the size of the deletion was 351 
bp. The 5' deletkxi endpoint was only 6 bp from th6 
presumed dorxx site fa the env mRNA spice, and the 3' 
deletk)n endpoint was approximately 50 t>p from the start 
codon for Pr65^ (Figure 2). The resulting deleted plasmid 
is termed pMOV-^". 

Virus Spread Is Defective after Transfection f 
pMOV-^- 

To determir^e whether pMOV-^~ generated transmissft^e 
virus, it and its wfld-type counterpart, pMOV-^*, were 
irxjeperxlentty transfected into N8-f^T3 ceQs, and reverse 
trar^scriptase in the culture supematants was assayed over 
time. By 4 days after transfectnn. the mec&im from a 
cuftue transfected with pMOV-^* had 3-fokJ more reverse 
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Rgure 1. Coostruction of pMOV-^" 

peR322 and SV40 sequences, are represented by thin lines, MuLV se- 
quences are represented by ttiick Itr^. arxJ mouse sequerces flanking the 
provirus are represented by wavy tines. Ttie kyng termlrwl repeats of MuLV 
are indicated by open boxes. (VS: intervening sequence of SV40 smaft 
tumor antigen. 
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Figire 2. Schematic Representation of the Provirus Present in pMOV-^" 
The long terminal repeats are represented as dosed boxes, and interrvtt 
MuLV sequences are represented as a thin line. The 350 bp present 
between the Bal I and Pst I sites were defeted arxJ a Hind B site was 
introduced at the site of deletion. This new Hind 0 site is six nucleotides 
from the apparent dorvx spfice site for the errv mRfSlA and 55 nucleotides 
from the ATG for Pr65«(Shinntck et aJ.. 1981). 

crease in activity as a result of efficient spread of virus 
from the riitialty transfected ceils to all cells on the plate. 
In contrast, for 4 to 6 days following transfection of pMOV- 
no detectable reverse transcriptase activity was ob- 
served in the medium, suggesting that virus spread was 
initially defective. Uninfected cells had less than 5% of the 
activity of MOV-1. 

Within 9 days of transfection of pMOV-^", large amounts 
of reverse transcriptase activity developed in the culture 
supernatant (Figure 3). suggesting that virus spread even- 
tually occurred. The reverse transcriptase activity present 
by day 9 could have been due either to the slow, atten- 
uated spread of mutant vinjs or to the generation of a 
nondefective virus by recombination with a cellular se- 
quence capable of providir>g the missing function. To 
differentiate between these possibilities, various dilutions 
of culture supematants from cells transfected 9 days 
previously with pMOV-^* or pMOV-^~ were used to infect 
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Figure 3. Reverse Transcriptase Activities of Cutture Supematants as a 
Function of T^ne After Transfection with Either pMOV-^* or pMOV*^" 
Sofid dots represent background activity of untransfected NIH/3T3 cells. 
Thickfine: pMOV-^". Thin tine: pMOV-^*. Al reverse trarwcriptase activities 
are represented as a percentage of the activity produced by a standard 
MuLV producer cefl ine. MOV-1 . 

NIH/3T3 cells. High levels (3-4 times MOV-1 activity) of 
reverse transcriptase activity were observed 2 days after 
infection with only 10 /d of both supematants (data not 
shown), demonstrating the presence of nondefective virus 
in the culture supematants of cells transfected with pMOV- 
^~ 9 days previously. 

pMOV-^~ Provides All Viral Functions Required in 
Trans 

To determine whether the deletion made in pMOV-^~ 
affected functions required only in cis by the virus, helper 
activity was measured by using a recombinant retro^/irus 
genome derived from Moloney sarcoma viais (MSV) ca- 
pable of expressing the E, cdi gene encoding xanthine- 
guanine phosphoritx)syttransferase (XGPRT) (R. Cone. R. 
Mann. D. Baltimae. and R. C. Mulligan, unpublished data) 
(Figures 4A, 4B). This gene, designated gpt, is a dominant- 
acting selectable mariner in nrtanrvnalian cells (Mulligan and 
Berg. 1980). The plasmid (pMSVgpt) encodes no intact 
retroviral proteins yet retains the region deleted in pMOV- 
pMSVgpt also has the polyonna virus early region to 
increase its copy number in NiH/'3r3 cells eariy after 
transfection. To assess the helper activity of pMOV-^". 
we cotransfected pMSVgpt and a 10-fold excess of either 
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Figi^e 4. Structure of pMSVgpt and lis Use to 
Assay (he Helper ActMty of Retrowtf Constructs 

(A) Recornbinant plasmid pMSVflpt is shown. Se- 
quences derrved from retroviruses are ndfcated 
by thick ines. with the long termini repeats (LTRs) 
as fifled boxes. The ffp* gene (MuBigan and Berg. 
1380) is indicated by an open box. Mink ceOUar 
DMA flanking the recombinant retrowhe is ix*- 
cated by a wavy bne. The potyoma vrus earty 
region (Magnusson and Nfcon, 1977) (base paw 
2932 to 4632) is hdkrated as a dashed frie. and 
p6R322 sec^jences derived Irom the "poison rrt- 
nus" derivative (Lusky and Botch^i, t98l) are 
indicated by a thin frw. - . 

(B) Structire and origin of the provirus se^nent of 
pMSVgpt showing that no intact retrovvd genes 
are present. The reason tor indudkig the )(ba I to 
Xba f fragment of MM<-V in pMSVgpt is fretevant 
to the present use of the ptasmid. The MSV restric- 
tion map is from VandeWoudeetal. (1979). 

(C) Protocol for assessing the helper activity of 
retroviral constructs. 
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pMOV-,;.* or pMOV-^- into N1H/3T3 cells (Figure 4C). After 
24 hr, culture supematants of the transfected ceils were 
used to infect a second plate of NIH/3T3 cells. Two days 
following the infection, selection pressure for XGPRT was 
applied, and 8-10 days later gpt colonies (colonies con- 
taining the gpt gene) were counted. pMOV-^^* yielded 2.5 
X 10* gpT colony forming units (cfu) per milliliter of culture 
supernatant, and pMOV-if-- yielded 5x10^ gpt cfu per 
milliliter (Tat)le 1). No gp/* cfu wer6 detectable when 
pMSVgpt was transfected by itself. Thus pMOV-^- could 
efficiently provide all functions in trans required for viable 
virus production. 

Construction of Cell Unes With pMOV-*" Stably 
Integrated and Expressed 

To explore further the properties of pMOV-^'-. we decided 
to establish cefl Ones containing the mutant stably 
integrated nto the chromosonne. These lines were made 
by the cotransfection of pMOV-^- and pSV2gpt, a SV40 
hybrid vector capable of XGPRT expression (Mulfigan and 
Berg, 1980). Cells from gpf* colonies obtained in this way 
were doned and estabfished into 3 lines. ^1, ^2 and ^ 
3. AO three fines had detectable reverse transcriptase 
activity in the culture si4)emalant (Table 2). indicating that 
^ afl contained sufficient MuLV infomiation to express 
and bud reverse-transcriptase-containing particles. Only 
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Table t. Assay of helper activity of pMOV-f* and dMOVi^- 


Helper Plasmid 


ffpf*(cfu/ml)- 


pMOV-^* 


2.5X10* 


pMOV-^- 


5x 10> 


none 


<10 


'T^ ^ minslected oel aitue supema- 

tant 24 hr after transf ection. 



ml of culture supernatant from either ^1 or ^-2 onto NIH/ 
3T3 cells yielded no reverse transcriptase producer cells 
even after 3 weeks of culture. Thus the f-3 cells were 
producing a nondefective viais. wherease the ^1 and <^ 
2 ceils were not. 

Viral RNA in f -2 Cells Is Poorfy Packaged int 
Virl ns 

Because the r mutant appeared defective in a function 
requ^ed only in c/s, we suspected yfr RNA of missing 
sequences innportant fa its packaging into virus partide^ 
To address this questk)n directfy. total cefiular RNA was 
purified from the MOV-I (an f^mi.V producer) and 4^2 
ceB lines, and viral RNA was exfracted from purified parti, 
cles released from the MOV-1 and ^2 cells. The MOV-1 
and yh2 viral RNAs were derived from virion preparatkxis 
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Table 2. Properties of cefl lines stabty transfected with pMOV-V*' 

Reverse gpf* (clu/rnl) 24 

Transaiptase hr after 

Activity in Transfection with 

Helper Une Siipematanf Infectrvrty* pMSVgpT 



MOV-1 


100.0 


+ 


5x 10* 




2.0 




GO 




35.0 




10* 




13V4 


+ 


icy* 


NM/3T3 


0.9 




<10 



*lri SBtttrary units with the wild-type MuLV producer tine, MOV-I. set equal 
to 100.0. 

*'lnfectivrty was judged by infecting NIH/3T3 cefls with 1 ml of the cutture 
si^iematant and measuring reverse transcriptase activity fa at least two 
weeks fdowing the infection. 

*0pf^_colony fomning units per tnBHAef of ctiture superrtatant of ceBs 
transf ected with pMSVgpL 

activity and p30 protein by measuring their concentration 
in portions of the purified particles just prior to RNA 
extraction. Both cellular and viral RNAs from MOV-I and 
^2 were fractionated by electrophoresis through a 1% 
agarose-formaldehyde gel. transferred to a nitrocellulose 
filter, and prol^ed with a nick-translated plasmid containing 
the entire M-MuLV genome (Figure 5A). Full-length ge- 
nomic RNA could be detected in MOV-1 cells (8.2 kb, lane 
1) and in ^-2 cells (7.9 kb. lane 4), as could the subgenomic 
2.9 kb env mRNA (lanes 1 and 4). The 8.2 kb genomic 
RNA could be readily detected in RNA preparations from 
MOV-1 virions (lanes 2 and 3). but no hytiridizaWe RNA 
was detected in RNA preparations from an equivalent 
number of ^-2 particles (lanes 5 and 6). In Figure 5A. lane 
6 contains RNA from 10-foid more ^-2 particles than were 
used fa the MOV-1 particles in lane 3. Also, exposure of 
the autoradiogram shown in Figure 5A for a 10-fold lor>ger 
time stiQ revealed no hytxidizable bands in lanes 5 and 6 
(data noi shown). Thus partk:les released from ^-2 cells 
contain at most 1% of the viral RNA found in MOV-1 
p>artides. 

Further quantitative data were obtained by spotting 2- 
fold dflutkxis of MOV-1 and ^-2 viral RNA preparations 
onto a nitrocellulose filter and probing it with a nick- 
translated plasmid containing the entire MuLV gerxxne 
(Figure 5B). The amount of RNA spotted was again nor- 
malized to the number of particles present prior to RNA 
extraction by measuring reverse transcriptase activity and 
p30 concentration. From this analysis, it appeared that ^" 
viral RNA was packaged into vims particles with an effi- 
ciency much less than 1% that of wild-type viral RNA. 

^-2 Can Efficiently Package Other Retroviral 
Genomes into Infectious Particles 

We next examined whether the pMOV-^~-containing cell 
lines coukJ package a highly defective retrovirus genon>e 
into virus partides and bud the particles into the culture 
supernatant without also budding helper virus. To this end, 
pMSVgpt was transf ected into ^-1. }P-2, ^-3. MOV-1 and 
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Figure 5. Hytxxfization Analysts of CeSUar and Vral MuLV Fth4As From 
MOV-1 and ^2 CeBs 

(A) Pofyadenyfated cebiar RNA and viral FQsiA from MOV-1 and ^2 were 
fractionated by electrophoresis through a 1% agarose-formaldehyde get. 
transferred to a nrtrocediJose filer arxJ probed with a nicK-translatedptasmid 
containing the entire MuLV genome. (Lane 1} 3 ftg polyadenytated MOV-1 
celkiar RNA; (lane 2) viral RNA from puified MOV-1 particles: (lane 3) 10- 
tdd the MOV-1 viral RNA present in lane 2: (lane 4) 3 polyadenylated 
^-2 ceOular RNA: (lane 5) viral RNA from purified 4^2 partides: (lane 6} 10- 
fotd the f-2 viral RNA present in lane 5. Lanes 2 and 5 ccrUain RNA 
extracted from equivalent numbers of MOV-1 and ^-2 partides as deter- 
mined by both reverse transcriptase activities ar>d p30 concentrations. 

(B) Sequential 2-fold dilutions of viral RNA obtained from puified MOV-1 
virus or f -2 particles were spotted onto a nitroceiiiose fiter and probed 
with a nick-translated plasmid containing the entire MuLV genome. As in 
(A) the anxxjnt of viral RNA used was rvymalized to reverse transcriptase 
activities and p30 concentrations present in the paftides prior to RNA 
extractjon. 

NIH/3T3 cells, and 24 hr later the cutture supematants 
were used to infect fresh NIH/3T3 cells. Two days after 
the infection, selection pr ssure for XGPRT was applied, 
and 8-10 days later gpt* colonies were counted. CeD line 
^-1 produced only 60 gpf* cfu/ml of culture supernatant 
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but ^-2 yielded 10* cfu/ml (Table 2). Mov-1 and f-3 
produced 5 x 10* and 10* cfu/ml, respectively. Further- 
more, cells rendered gpt* by using supematants from 
transfected ^-1 or ^-2 cells were producers of neither 
reverse transcriptase nor gpf"^ cfu and remained negative 
for both after 4 weeks of culture. In contrast, cells rendered 
gpr using supematants from transfected MOV-1 or ^-3 
cells were immediately producers of reverse transcriptase 
and gpt* cfu. 

Discussion 

By deleting 350 bp from a M-MuLV DNA clone, we have 
created a mutant, pMOV-^", that cannot itself be propa- 
gated as a virus, but that still efficiently provides all of the 
otrfigate trans retroviral functions. Upon transfection of 
ctoned recombinant retroviruses, such as pMSVgpt. into 
cefl lines expressing pMOV-^". such as ^-2, high titers of 
infectious retroviral particles are produced which contain 
the recomtMnant genome. These particles are able to 
transmit the recombinant genome efficiently via the retro- 
virus life cycle. In contrast, the ^" genome is not detectabty 
packaged into particles and therefore helper virus is not 
produced. Thus the defect in pf^V-^" appears to be a 
c/s-actrve defect and probably involves a packaging site 
on the RNA. presumably one recognized by a viral protein, 
rather than a region encoding a protein needed for pack- 
aging. Deletion of sequences in spleen neaosis virus at a 
similar positkxi to that described here has been shown to 
result in a c/s-active defect (Watanabe and Temin. 1 982). 

taurine retroviruses often encode a glycosylated form of 
their Pr65^ protein that is longer at the N-terminus than 
Pres*^ (Edwards and Fan. 1979. 1980). The pMOV-^* 
clone contains a deletion of ail of the information that might 
erxxxJe such an N-terminal extenston and yet provides all 
of the needed helper virus functions. The 350 bp deletkjn 
also removes all three AUGs that are present between the 
5' LTR and the AUG used for Pr65*^ initiation. It would 
thus appear that, at least in fibroblasts, the glycosylated 
gag precursor is not needed for efficient budding of 
infective progeny, although this has not been tested di- 
rectly. 

Dimerization and Packaging of Virion RNA 

Electron microscopic studies of 70S virion RNA have 
suggested that a site approximately 300 nucleotides from 
the 5' end of the 35S RISIA molecule may be important for 
genome dimerizatkxi (Bender et al.. 1978). This regk)n is 
within the 350 bp deletion of pMOV-^*. If din)erizatk)n is 
necessary for packaging, then the defect in pMOV-^' may 
be a consequerx» of the deietkxi of the dimerizatk)n site. 
Studies by Cheung et al. (1972) and C^anaani et al. (1973). 
however, suggested that the packaging and dimerizatkxi 
processes are at least temporaDy (Sstinct. because they 
found that freshly budded virions of Rous sarcoma virus 
<»ntained onfy the monomer RI^ and that cfimer formal 
foflowed budding. Smaller mutations wiH have to be con- 

Stnirt#v< tn ovamino whothor naHranirvi anri drmoriTafw 



Generation of Nondefecthre Viruses 

When pMOV-^- DNA is transfected into mouse cells, 
nondefective virus appears after a lag of 4 to 6 days. The 
slow appearance of nondefective virus, and the ease of 
isolating cloned cell lines that hartor pMOV-f " txit do not 
generate nondefective virus, suggest that only a minority 
of transfected cells yield nondefective virus. We assume 
that recombination b>etween some cellular sequence and 
pMOV-^--derived DNA or RNA is responsible for the gen- 
eration of nondefective virus, but we have yet to provide 
direct evidence for this. Ukely candidates for providing the 
packaging site sequence are the family of abundant, 
packagable, virus-like 30S RNAs found in mouse cells 
called VL30 RNAs (Shenvin et al.. 1978; Besmer et al 
1979; Scolnick et al.. 1979) and the many retrovirus-like 
DNA elements found in normal mouse cells (reviewed in 
Coffin, 1982). We are currently analyzing the structure of 
the nondefective viruses as well as determining whether 
these genomes can be generated in other cell types. 

Use of 

In a number of viral systems it has proved useful to design 
cell lines capable of providing certain viral functions. Such 
cell lines can then be used as hosts for growth of viruses 
defective for production of the proteins already present in 
the cells. Fa instance, the hr-t mutants of polyoma vims 
were isolated in this way (Benjamin. 1970). earty mutants 
of SV40 can be grown in COS cells (Gluznian, 1981). and 
eariy mutants of adenovinjs can be grown in 293 cells 
(Graham et al.. 1977). The ^2 cell line described here is 
an analogous cell line for defective retroviruses. Wrth such 
a line, stocks of natural or engineered 'defective retrovi- 
ruses, such as pMSVgpt and others (Mulligan. 1983). can 
be obtained totally free of detectable helper virus. This tool 
enhances the utility of retroviral vectors (Wei et al.. 1981- 
Tabin et al.. 1982; Mufligan. 1983) becau^ pure stticks of 
the engineered viaises can be used to introduce genes 
into celts via the retroviral life cycle without the recipient 
cell becoming a retrovirus producer. FurthemrK)re. genes 
cloned into retroviral genomes could, in principle, be intro- 
duced into the germ line of mrce without also introducing 
helper viajs. in much the same way as has been demon- 
strated for MuLV (Jaenisch, 1976. 1977; Jaenisch et al.. 
1981). thus avoiding skie effects inherent in having helper 
virus present 

Experimental Procedures 
CeOo ond Vbuseo 

The t&i/m ceO Ene was grown in Dutoecoo's modified EagJe mecfijn 
containinQ 10% caff serum. The MOV- 1 ceOEne was derived 
of pMOV-^* (actuaSy pMOV-9, a gif! from R Jaenisch) rto HH/3T3 ceSs 
foSowed by dortng. CeOs to be cloned were Irypsintzed. counted, and 
seeded into S&weS dormg trays at 0.3 ceSs per weS 0" a vo^lffne of 0.2 
mr per wea) and aSowed to grow lor 10 to 12 days. SeSection rnecfwn for 
ffpT was as descrftjed (Mi^gan and Be^g. 1981). Virus infections were 
pertorrned in the presence of 8 MQAm polytrene (or 1-2 fv. 

RovGTDO TrEmecripi2ese Aoscyo 

Reverse franscrptase assays were pertormed as desatoed (Goff et ai.. 
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DMA Tranof ectkm 

Twenty-lour hours atter 10^ cells were seeded on 100 mm piates. the 
cultijes were iransfected wrm 10 MO of piasmxJ DNA by usin^ 
of Graham and Van der Eb (1973) as nxxfified by Pafker and Stark (1979). 
tn cases where selection fa transfected ceOs was appfied. ceOs were sptit 
1:20 into selective media 48 hr after transfection. Unsetectabte DMA was 
cotransfocted in lO-foJd motar excess over 1 i/q of setectable OIMA. 

Preporatkm ond Anoljroie of RNA 

Total ceCutef RNA was prepared by the method of Orrgwih et al. (1979), 
and pofyadenylated RNAs were seteded with oJjgo(dT>ceflulose as de- 
SCTibed by Ouintrefl et al. (1980). Virion RNA was prepared by extraction 
from vinisparticies banded in sucrose step gracfients (Shields etal.. 1978). 
RNA was size-fractionated by eiectrophorests through agarose-formaide- 
hydegels (Maniatis et al., 1982). transferred to retroceflutos e . and hybridized 
with *^-labe«ed04A probes as described (Rigbyetal., 1977). Quantitation 
of RNA by dot btots was done as described by Thomas (1980). 

Conotructton of pSMV-i^' 

Ttffee purified DNA fragments were Egatod together to constiud pMOV-^" 
(Rgure 1). The first was obtained by digesting pMOVif* (Owmakov et al., 
1982) with Xho I to completion, fo3owed by partial <£gestion with Eco RI. 
The fragment extencfng from the Xho I site al 2.0 U in MuLV. through the 
3' LTR. 3' mouse flanWng sequence, all of pBR322, and entSng at the Eco 
RI site was purified from an agarose gel after electrophoretic separation 
(Vogebteen and G3espie, 1979). The second fragment was obtained by 
<*gestion of pl^-^* wi&i Ball to oompJetion foflowed by purification of 
the fragment extencfing from the Bal I site in peR322 through 5' mouse 
flanking sequence and 5' LTR to the Bal I site kx:ated at 0.7 U of- MuLV. 
Mnd 0 linkers (CoQaboralive Research) were then bhjnt-ligated to this 
fragment with T4 DMA Bgase. and the fragment was <Sgested with excess 
Hind a and Eco RI. The. LTR-containing fragment was purified from an 
agarose gel atter electrophoretic separation. The third fragment present in 
the final figation reaction was obtained from pSV2-gag/po/ (R. hAi^gan, 
urpubSshed plasmtd) where the gag/pol region of MuLV had been sub- 
doned into pSV2 (MuSgan and Berg. 1980). pSV2-gag/po/ was digested 
to completion with Xho I and Wnd □ and the fragment extending fro(n the 
Knd a site (changed from the Pst I site at 1.0 U of MuLV) to the Xho 1 site 
at 2.0Uof MuLV was purified from an agvose gel foflowing electrophoretic 
separatioa These three Of^ fragments were then mixed in equimotar 
amounts at a total ONA concentration of 50 ftgM txiffer (50 mM 

Tris+ta [pH 7.8), 10 mM MgO,. 20 mM <fthjothreitol. 1 .0 mM ATP, 50 ^9/ 
ml kx>vine serum atoumin) arxj incubated with T4 DNA Ggase for 18 hr at 
15*C. E. co5 KB101 was transfected with the figated DNA, and ampid3in- 
reastant transfectants were obtained. The plasmid DNA obtained from a 
number of transformants was screened for the desired structure by (figes* 
tion with appropriate restriction endonudeases and electrophoresis through 
agarose gels (Davis et al.. 1980). 
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Activating point mutations In th common p subunit 
of the human GIVI-CSF, IL-S and IL-5 r c ptors 
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and cell type-specific molecules in signalling 
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We have combined retroviral expression cloning with 
random mutagenesis to idenUiy two acUvaUng point 

JPc) or the rmptors for human granHlocyte-macn». 

facinr ^IJ^^^ *"""'y to confer 

n?p. JS?^""". haemopoletic cell line, 

roC-PI. One mutaUon (V449E) Is located within the 

X 5 .. oncogene, may act by Inducing 

tiZ^T " ."^ "The other mutaUon (1374N) iS 
fL w!^?"""?'""' 'ne"'bn»e.proximal portion of 

7"^"' ""P* ^""^ ^hlch hai a small 
1/ J?^**? * traceUular region), was capable of 
Indudng factor independence In CTlU ceilT while 
SS'^'^T?"" "H'uce factor independLS S^! 
^^J^ results Imply that the extnweUutor 
JJtaSTL?''"™ niutaUons act by different mech- 
anbmi Furthennor^ they Imply that the mutants, and 

SS/kT'!?"- presented VSS 

some of the characteristics of the putaUve rewDtor- 

associated signalling molecules. 

i^jwrtte cytokine receptoK'superfamily/human GM-CSF 

Chain/oncogenic activation/poly- . 

"""»«««is/ret«.viral exp««lon 



Introduction 

SS"I^?S"^?'"£^ coIony-stimulaUng factor (GM- 
CSJ) and ncerieulun (IL)-3 stimulate the prolifefation 
diffcrenuauon and functional activity of a wiK«y of 
haemopoicuc cells, including ncuin^hiis. eosiSls 

eLSon^f *« proliferaUon of 

eosinophil progenitors is also shared by IL-S (reviewed 
by Sandenon. 1992). n,!, function^ overlap, as weTa^ 

strf^^""*^.^''" ^ ' '"^o" n 'h" 

surface of human haem pOietic cell, (Upez « a/.. 199?)! 

is n^L^o^^'ll" ''^ composition o 

ihe receptors fo> these three CSFs. Tht hiVh-affinity 

427i6 



and «L-5 (hlL-SR) are composed of ligand-specific 
subuntu (hGMRo. hIL-3Ra and hIL-5Ra) associWi wiS il^ 
a common p subunit (hpc). TTie o subunits binTthS 

w!^ °"r*i!^ '"'"''y- "Pc "tone does 

not bind any of these factors detecubly. buVis requiiS^" 

m association with the a subunits. to confer high^ffiS; ^ -1 

bindmg (Geanng « «/.. 1989; Hayashida ei aL, |«SS^ - 

Kuamunt«a/ 991a: •ftvemiere/a/.. 1991). Morim! 

hPc js e«cn„al for rig,«l tnm«luction (Kitamum et^M 

991b. lOtamuni and Miyajima, 1992; Sakamaki « a/. " ' 

1992) and. as a shared signal-tnuisducing component ^ 

P|ov,des a molecular explanaUon for the overlapS ^ 

biological characteristics of GM-CSF. IL-J and IL-5 - 

Both subuniu o; the receptors for GM-CSF. IL-Vand ■ 

nCL^ ambers of a large family, often termed the r 

1992 that includes the receptors for many haemopoietic 
gmwth factor. Receptors of this class arJ chamcSSd 
by a conserved structure (including the hallmark WSXWS 
SlT 1f"I'P"^"i '""'•'^ i^^'^ extracellular 
Uien refined by detenmination of the structure of the 
complex between human growth hormone (hGH) and its " ^ 
receptor (hGHR) (de Nfes e, al, 1992). Unlike the othS ' 
major class of growth factor receptors, the receptor tyrosine ^ 
kinases, members of the cytokine receptor family do not ' 
appear to possess any intrinsic enzymatic activities that ''"^ 
could account for intracellular signalling 

Despite the fact that B subuhits and their associated a : 
subumts lack an intrinsic tyrosine kinase acUvity. it has 
been thoroughly documented that tyrosine phosphorylation 
of a number of proteins, including the p subunits them- t 
selves, IS rapidly induced following ligand bindine 

ei al 1993). At least two distinct regions of the intnicellu- 

^Ll.T'"" °^ !" the generation of . 

asparate 'ntiacellular respbnses (Sakamaki ei al., 1992; 
Sato ei aL, 1993). which impBes that hpc is associated 
with and utilizes multiple effector molecules. Indeed, 
phosphorylation is believed to be carried out by one or 
rnore receptor-associated tyrosint kinases such as JAK-2 • 
SS?v"T" " (Hanazono el al., 1993) 

^tr^T^'V' "^2)- mechanisms 
rLr^J-. • njolecules are activated are not 

cl«r. While It IS accepted that signalling is mediated by 

T**?'"* ^ the o and P 

subunits neither the precise stoichiometry nor the role of 
each subunit in signalling is clear. On one hand, the 
cytoplasmic poition of the a subunit is required for 
e,^TX\^ ""^ '^-^ receptorTcSakamaki 

^L. i " "95= Polotskaya « al., 1994). 

S.^ «='"'"«*c receptors suggest 

lliat dimefizaupn f the intracellular portion of the P 

Oxf rd UnWertity Press 



subunii may be sufficient t iniiiaie signal iransduction 
(e.g. Sakamaki */ a/.. 1993: Takaki ei al.. 1994 sec 
Discussion). * 

Regardless of ihe precise mechanism by which thev 
function, the GM-CSF. IL-3 and IL-5 receptors transduce 
proliferative signals, and it would Iheref re seem likely 
that their common signalling subunit. hpc. represents 
a potential target for oncogenic activation. Indeed the 
oncogenic potential of several members of the cytokine 
receptor supcrfamily has already been demonstrated: the 
v-m;,/ oncogene of the myeloproliferative leukaemia virus 

;^ Z^^ wn^" " °f *e recently charac- 

f^^-^^PL receptor for thrombopoietin (Soujri ei al., 

T 'I ' " '99^)- % single 

ammo acid substitution in Ihe extracellular domain of Oie 
efythropoietin receptor confers factor-independent growth 
and tumongenicity on factor-dcpendent haemopoietic cells 
(Yoshimura et al, 1990). More recently, a constitutivery 
active mutant of hPc (HA), that aro« spontanTu K 
following retroviral transduction, was isolated by virtue . 
of iLs ability to confer factor independence and tumori- 
*f 1^ haemopoieUc ceil line (D' Andrea 

Here we report the identification of two activating point 
mutations in h^c that were generated using a polymerase 
Cham reaction (PCR)-based random mutagenesis pro- 
cedure. By combining this procedure with a retroviral 
expression cloning system recently developed in our 
laboratory (Rayner and Gonda, -1994). these mutanu 

■nt ^.!K-r.°"r' .'^»"o^-<'«l«ndent haemopoietic cell 

me. The abi ity of poim mutations to activ 
the possibility that such mutations could comribEte to 
human leukaemia. In addition, the nature and properties 
of Ujese muumts also have important implications f^ the 
mechanism of signalling by the receptors that utilize the 
npc subunit. 



Results 

Generation of a retroviral expression library of 

point-mutated fifie eDIM construed 

To investigate the oncogenic potential of hpc. we intro- 

of the hPc cDNA encoding the 191 membrane.|*oximal 
Z °I iransmSnbrTe 
(rT^ Ta?w ""f^ "sidu^ of the cytoplasmic domain 
LZ^ol^ .- "S™"* « « potential 

target for activaung mutaUons because it encompasses 
several sequences that have bebn implicated in signalling 
l^IT'T? °^!'y'°^'«^ -^eptors: (i) the meSbrane! 
,T„ /n^AT'"""'? •'*8ion essential for signal transduc- 

TJ l^'^ '! ^"^^ " '992: Sato 

wsi^i? = ''993): (ii) the highly conserved. 

WSXWS sequence that is a hallmark of the cytokine 

e'^Shfi!!'*'^^"^ f"'> equivalent 

extracellular region of c-MpI remaining in the v-Mol 
oncoprotein (Souyii e, at., 1990; Vigon ei al., 1992)- 
^ «"!"««'''"'>' sequence duplicated' in the 

« a/ llS ^'^""^"^ 
Our approach to identifying.activating mutations in hBc 
>wast c "Struct a bank of random point muunts and thS 
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Fig. I. Localion of aclivaiing point mulaiions in the region of the B 
subun.1 of the human GM CSF/IL-J/IL-J «cep.on abkcted ,o 
random muiagenesij. (A) Schematic illustration of hBc showin. the 
«wo cytokine receptor modules (CRMs: Coodall ti «X 1993) the ' 

WSXWS moufs (thick venical lina) (tee Bazan (1990) for deseription 

. "1'"'""' Pn>lifc«iive signalling 

"1 the cytoplasrow: domain (Sakamaki « a/.. 1992) (black shadina 
The posuioni in the hpc cDNA of the ;rA»l v,A Bglll sites tf» delimit 
.he fragment used for mutagenesis a« shown und^th. (BU«, fo" 
of pomt muiatrons in activated fonns of h|}c. IT* Mructural feai.v, of 
hpc are illustrated as in (A). TV sequences of the regions coritaini'ng 
mu alton, in mutant, I and 2 (see text) are espandedfthe Sing* 

muiani 2 ts underlined. 



mscrt these en masse into a retroviral expression vector 
lUe resultant retroviruses were then used to infect a 
munne factor-dependent haemopoietic cell line (FDC-Pl) 
following which these cells were selected for the ability 
to grow in the absence of factor. TTiis procedure, which 
IS outlined in Figure 2. is essentially a combination of the 
I TliSS!""'"' technique described by adwcll and 
Joyce-(l992).and methodology previously developed in 
this laboratory (Rayner and Gonda. 1994). 

As described in Materials and methods, we were able 
to define PGR conditions, based on tiiose described bv 
adwell and Joyce (1992). tiiat resulted in the unbiased 
random generation of mutations at the desired rate of 
70.2% (I intfOO bp). The mutated hpc fragments were 
inserted direcUonally into pRUFNeo-hpc (see Materials 
and methods and Figure 2) from which the segment 
subjected to PGR mutagenesis had been excised TTiis 
resulted in the generation of a library, comprising I 7x I0» 
plasinid clones, of hpc cDNAs bearing point mutations in 
Ae targeted segment only (hpc™-). Using procedures ' 
described previously (Rayner and Gonda, 1994) the 
plasmid DNA was used to eenerate a retroviral library 
estimated to contain 2.1X10^ independent viral producer 
cl nes. Assuming that there was no verwhelming bias in 
the procedure, tiiis should adequately represent all of the 
possible point mutations in the 940 bp hpc fragment 
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Xhol 
-I 



Bglll 
— I-- 



hpccDNA 

4 

PCR mutagenesis foil wed by 
Xh I/Bglll digesU n 

4 

Insert pool of mutagenized fragments 
Into retroviral vector carrying 
remainder of hpc 

Introduce into retroviral packaging 
cells 

Select for G418 resistance 

Use pooled vinis-produdng packaging 
cells to Infect target cells (FDC-Pl) 

Screen for factor-Independent growth 

4 

Recover and sequence mutated 
h3c fragments 

hfc'i^ll''^,^' ««n«egy used for gcntmion and exprtssion of 
.^JT^ "°P ''I*""" «^ Primen used for PCR 

tmphtouonAnuugeneris of ihe segroem of hf)c Zwn in Rbu^ | 

SSS^nTw-i^ been dig«ied with BglU ind Xho\. Tht itwIttnT 
po^ofplMaudi was introduced into <W «„o»i™i packMing celuL 
• '*«^^«'»'yR«yn"«»d Good. (I9M). Allowing SKLtia^ 
which wtre subsequendy selected for fMor indepewlence Tht 

/5o/«tfon offaetor-independent FDC P1 cells 
^ng constitutivelY active hpc mutants 

nr if 1 ."^ Jtependcnt on mouse GM-CSF (mOM- 
^v. kL ^""^^^ for growth, and normally die within 
days when starved of cither factor. Howev„. Aey do 
GlSlS?n!riK"p^ «'"«n««tions of human 

n.'^l ^Sw;P'^ °' <'''l-3) «f the appropriate 
four ."!„"" 'j"-^'*°> " co*xP«ssed with hPc 

our^unpubh^ results). We therefore reasoned th« 

SSen^Jfr °f "Pc should induce factor! 
mdepCTdentproIiferation: this approach has been validated 
. ^ consUtuUvely acUve hBc 

expression library. FDC-Pl cells Ty ^- 

4278 



cultivaiion with the pool of hpc-"" virus-producine 
cells (sec Materials and methods). Parallel co<uUivatl ns 
were also performed with uninfected ^2 cells and «P2 
cells producing wild-type h^c rcirovinis. After I week in 
liquid culture in Ihe absence of factor, all four FDC-Pl 
cell pools infected with the mutant hpc retroviral library 
contained viable, proliferating cells, while no such cells 
were present in the control cultures. Factor independence 
was not a result of autocrine growth factor production as 
conditioned medium from the factor-independent cell 
pools did not support the growth of uninfected FDC-Pl 
cells (data not shown). A total of 31 factor-independent ' 
clones was isolated from the factor-independent linuid 
culture pools by agar plating: each of these was analysed 
further. Based on infection frequency and the number 
of independent mutants (detected by sequencing and/or 
restnction analysis: see also below and Figure 7) wo 
S'^Mniy"'''"'"*''' °^^''*='°'-'"''<='*"<lc"'.'nutations to 

Identification of activating mutations in 
eonstitutively active lifie mutants 

During the course of recovering the mutated region of 
ftpc from the factor-independent clones by PCR it was 
discovered that 22/31 clones contained an additional Bglll 
rcsmcuon site (as shown in Figure 7). Sequencing of Snc 
such clone (mutant I) revealed that this was due to a T 
to A mutation at nucleotide 1374. which results in a 

^ ^^tl^li..*'""" transmembrane domain (see 
Figure IB). TTjis is strikingly reminiscent of the activatine 
point mutation (V664E) found by Baigmann e, at. (1986) ' 
in the rat neu oncogene (see Discussion). Sequence ana- 
lysis of one other clone that lacked the extra BgRl site 
(mutant 2) revealed two point mutations: an A to G 
substitution at niiclcotide 1112 which resulted in the 
subsutuoon of lysine 362 to glutamic acid (K362E). and 
a T to A substitution at nucleotide 1149 which resulted 
in the substitution of isoleucine 374 to aspaiagine (I374N). 
both of which he fn the exuacellular portion of the targeted 
region (see Figure IB). cuugeieo 
To confirm that die V449E mutation was indeed capable 
of acuvating hpc. and to identify which of the two 
muutions in mutant 2 was responsible for activation, all 
three mutations were re-created indepcndcnUy by site- 
directed mutagenesis. We also constructed two additional 

knowntT^ "'^ <"her mutations 

known to activate c-it« (BargmSnn and Weinberg. 1988- 
see also below). Following insertion into the pRUFNeci 
retroviral vector and transfection into H'2 tftckaeine cells 

mPM " ""Ji"™ without 

^Jjol ? °l '''"'^ efficienUy generated 04 18- 
rcsistant cells, which were subsequendy analysed for hPc 
expression by antibody staining and flow cytometiy. In 
Uiesc and subsequent experiments, we also included die 
previously described activated hpc mutant FIA. which 
contains a 37 ammo acid duplication in the membrane- 
^rT extracellular domain (D' Andrea 

It k^}' T": I • substantial proportion 

except dial carrying die V449D mutant, expressed hpc 



Constitutive ectivotlon of K^c by point mutation 
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Fig. 3. Row cytometric analysis of wild-type (wi) and mutant hfk 

expression on infected FDC-PI cells. Dotted lines ( ) represent 

cells stained with an irrelevant control antibody and solid lines ( ) 

indicate staining with anti-h^c antibody. Cell number and fluorescence 
are in arbitrary units; the latter is plotted on a logarithmic scale. 

(A) Cells selected for resistance to G4I8; also shown are analyses of 
uninfected cells and cells infected with the pRUFNeo vector alone, 

(B) Cells selected for growth in the absence of factor. 

(Repeated attempts lo obtain surface expression of V449D 
were unsuccessful.) However, only the I374N, V449E and, 
as expected, FIA mutants conferred factor independence on 
the FDC-Pl cells; the K362E, V449Q or (not surprisingly) - 
the V449D mutants all failed to do so (Figure 4A; the 
data for the non-expressing V449D mutant is not shown). 

Although the degree of proliferation in the absence of 
factor shown by cells infected with the V449E, I374N 
and FIA mutants was lower than that seen with factor, 
this probably reflects the fact that only a proportion of 
each G418-selected population expressed hpc (Figure 3), 
To eliminate the possibility that the factor-independent 
populations represented secondary mutations that con- 
ferred a selective advantage, we perfomied colony assays 
on FDC-Pl cells, immediately after infection, in the 
presence or absence of mGM-CSF. The results (Table 1) 
show that between 28 and 78% of clonogenic cells infected 
with viruses encoding V449E, I374N or FIA were factor- 
independent; this proportion is far too high to be due io 
secondary muutions in a subset of infected cells. When 
these same G418-resistanl populations were subsequently 
selected for factor independence, the proliferation observed 
in the absence of factor was comparable with thai of 
uninfected FDC-Pl cells in the presence of mCM-CSF 
(Figure 4B). The presence of the appropriate hjk cDNA 
constructs in the factor-independent FDC-Pl cells was 
further confirmed by recovery of the entire hpc fragment 
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Fig. 4. Prohfcraiion of FDC-PI cells infected wiih hpc mutants in the 
presence and absence of murine GM-CSF (A) Proliferation of cells 
selected and maintained prior to assay in mGM-CSF plus (except for 
the uninfected control cells) 04 1 8. Proliferation assays were ca^d 
out. as described in Materials and methods, in the presence o^ibscncc 
of mGM-CSF for uninfected FDC-PI cells and cells infected with 
wild-iype h(k (wi) or the indicated hpc mutants. (B) Proliferation of 
FDC-Pl cells infected with activated h|k mutants that were selected 
pnor to assay by growth in factor-free medium. Proliferation of 
unmfccicd FDC-Pl cells in the presence of mGM-CSF is shown for 
comparison. 

by PGR from genomic DNA, followed by restriction 
enzyme digestions diagnostic of each mutant (Figure 
7B). We therefore concluded that the V449E and I374N 
substitutions could confer factor independence on FDC- 
Pl cells and thus could constitutively activate hPc. Note 
that the latter mutation results in the loss of a BstW 
restriction site (as shown in Figure 7); by digesting with 
Bj/Yl, we subsequently found that ail nine of the factor- 
independent clones that lacked the V449E mutation carried 
the I374N mutation (data not. shown). 

Biological activity of hfic mutants ejcpressed in 
other cell types 

While the FDC-Pl cells used in the preceding studies do 
not express human a subunits, they do express murine 
GMRa and IL-3Ra subunits. Because signalling by normal 
GM-CSF, IL-3 and IL-5 receptors requires formation of 
a complex of a subunits, P subunits and ligand, we wished 
to investigate whether the ligand-independeni mutants 
could function in the absence of any a subunits. For these 
studies, we chose the murine IL-2-depehdent T-cell line 
CTLL-2 (Ceroiiini et a/., 1974) because it does not express 
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Table 1. Frequency of facior independence foltowing infeciion of (-DC-PI cells with hpc 


inui;in(s* 




FDC PI cells infected with**: 


Number of colonics*^ 








mCM-CSF 


mGMCSr + G4IK 




Perec ni fuctor- independent 


No vims 


550 


0 


0 


0 


Wild-type 


689 


56: 


f) 


0 


I374N 


936 


736 


573 


77.9 


V449E 


886 


565 


157 


27.8 


RA 


587 


491 


305 


62.1 



K:ell$ were washed and plated in agar-containing medium, as described in Materials and methods, immediately after co-cultivation with H'2 cells 

*Vini$-producing cells were sorted by flow cyionficiry for those expressing h(Jc prior to use in this cxpcrimcni. 

^Average number of colonies present on duplicate agar plates seeded with 10^ cells. 

talculaicd as the percentage of infected, i.e. C4l8-resistant. colonics that were factor- independent. 



any endogenous GM-CSF/1L-3/IL-5 receptor chains but 
has been shown to proliferate in the presence of human 
GM-CSF following introduction of both receptor subuniis 
(Kiiamura et o/., 1991b). 

Although infection of CTLL-2 cells by the murine 
retroviruses used was very inefficient (probably ^ 0.01% 
as estimated by G4I8 resistance; unpublished observa- 
tions) we were able to isolate CTLL-2 infectanis, 
expressing both wild-type and mutant (I374N, V449E and 
FIA) forms of hpc, by selecting for 04 1 8 resistance and 
maintaining the cells in the presence of IL-2. Moreover 
flow cytometric analysis (Figure 5A) showed that a sub- 
stantial proportion of these cells clearly exhibited cell 
surface expression of p subunits. Somewhat to our surprise, 
however, the resulting cell lines failed to proliferate in 
the absence of IL-2 (Figure 6A). Because of the low 
frequency of infection, and the consequent possibility that 
the CTLL-2 populations may have comprised only one or 
a few clones, we wondered whether they were expressing 
aberrant, i.e. spontaneously mutated, fonms of the original 
constructs. However, recovery of the hpc cDNAs from 
these cells by PGR generated products of the expected 
size and which exhibited the expected restriction enzyme 
digestion patterns (Figure 7C). 

One possible explanation for this result was that the 
activated hPc mutants function by interacting with an a 
subunit (sec also Discussion) and thus were unable to 
function in CTLL-2 cells lacking a subuniis, but could 
interact with the GMRa or IL-3Ra subunits in the FDC- 
PI cells. To test this possibility, and also to confirm that 
the CTLL-2 lines we derived could in fact respond to 
normal GMR-generated signals, we superinfecied CTLL-2 
cells expressing either wild-type or mutant p subunits with 
a vector carrying the hGMRa subunit and a puromycin 
resistance gene (pRUFPuro-hGMRct). We reasoned that 
if the mutant P subunits interacted with mouse a subunits, 
then it would be likely that they would also interact with 
human a subunits. 

We were able to derive puromycin-resisunt populations 
from each P subunit-cxpressing CTLL-2 line (i.e. wild- 
type, I374N, V449E and FlA); as only a proportion of 
these (12.3-33.2%) expressed the a subunit (data not 
shown), the latter were purified by preparative flow 
cytometry. Rc-analysis of the sorted cells showed that 
they expressed both subuniis of the human OMR (Figure 
5B). Nevertheless, none of the resulting populations could 
proliferate in the absence of IL-2 (Figure 6B), indicating 
that the presence of the hGMRa subunit did not enable 
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Fig. 5. Flow cyiomciric analysis of h^c and hGMRa expression on 
infected CTLL-2 cells. Nomenclature and axes are as in Figure 3. 

Doited lines ( ) represent cells stained with an irrelevant control 

antibody, solid lines (_) indicate staining with ami-hPc antibody and 

dashed lines ( ) indicate staining with anti-hGMRa antibody. 

(A) CTLL-2 cells infected with retrovirxjses encoding the h^c mutants 
indicated on each histogram were stained with an ami-h^c monoclonal 
antibody as described in Materials and methods. (B) CTU--2 cells 
infected with retroviruses encoding the h^c muumis indicated on each 
histogram were superinfecied with a retrovinis encoding hGMRa and 
stained with anli-hPc and anii-hGMRa monoclonal antibodies. For 
comparison, analyses of uninfected CTIX.2 cells and cells infected 
only with the hGMRa virus (a) are also shown. - 



the constitutive generation of proliferative signals by either 
h^c mutant. 

The possibility remained, though, that despite the PCR 
analysis (Figure 7C) there were other defects in the P* 
subunits expressed by the CTLL-2 cells or that our CrLL-2 
cells could not respond to GMR-generated signals. We 
thereTore transferred CTLL-2 cells co-expressing a and p 
subuniu from medium containing IL-2 to medium con- 
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Fig. 6. Proliferation of CTLL-2 cells infected with h(k mutants in the 
presence and absence of human GM-CSF or mouse IL-2. 
(A) Proliferation of cells selected and maintained prior to assay in 
IL.2 plus (except for the uninfected control cells) 04 18. Proliferation 
assays were carried out, as described in Materials and methods, in the 
presence or absence of IL-2 for uninfected CTLL.2 cells (uninO, cefls 
infected with the vcrtor alone (pRUFNeo). wild-type hPc (wt) or the 
indicated h^c mutants. (B) Proliferation of CTLL-2 cells infected with 
activated h^c mutants and supcrinfected with a retrovirus encoding 
hCMRa Proliferation assays were carried out in the presence of 
mIL-2, hGM-CSF or in the absence of either factor, as indicated 
Proliferation of uninfected CTLL-2 cells (uninO and cells infected 
only with the hCMRa (a) virus is shown for comparison. 

taining hGM-CSF Cells expressing ihe wild-type hpc, 
and also those expressing the mutant P subunits, continued 
to grow in hGM-CSF, as shown by proliferation assays 
(Figure 6B); in fact, growth could be maintained in as 
little as I ng/m! hGM-CSF (data not shown) which would 
be expected to stimulate dnlx, high-affinity receptors. This 
indicates that the mutant receptors could still interact in 
an apparently normal manner with ligand and a subunits. 

In view of these results, we wondered whether the 
ability of the mutants to confer factor independence was 
restricted to FDC-Pl celk. Retroviruses containing the 
mutant and wild-type forms of hpc were therefore used 
to mfccl the IL-3-dependent BAF-B03 subline of the 
pro-B cell line. Ba/F3. Figure 8A shows ihau as before 
wild-typc hpc and each mutani 
(y449E, I374N and FIA) could be readily obtained. 
However, only y449E could confer factor independence 
on these cells, as shown by proliferation assays (Figure 
8B) and prolonged monitoring of liquid cultures in factor- 
free medium (data not shown). As seen with the CTLL-2 
cells, superinfection f BAF-P03 cells expressing I374N 
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Fi^. 7. Analysis of proviral hpc sequences in infected cells by PGR 
and ftsiriction enzyme digestion. (A) Map of hpc cDNA showing 
Nco\ (N). BsCiX (Bs) and BghX (Bg) restriction sites used to 
authenticate each form of hPc. as well as the region duplicated in FIA 
(indicated by boxes). The restriction sites affected by the point 
mutations are indicated as Bg* (gained in V449E) and Bs" (lost in 
1374N). Arrows indicate the positions of PCR primen used to amplify 
hPc fragmejis from genomic DNA, (B) Electrophoretic analysis of 
PCR products generated from genomic DNA of FDC-Pl cells infected 
with the indicated hPc mutants; controls are reactions conuining no 
DNA (-) or DNA from cells infected with the vector alone (pRN). 
Unes M contain DNA size standards (SPPO phage DNA digested 
with £coRI (Bresatec Ltd, Adelaide. South Australia)!. PCR products 
were either undigested (lanes I), digested with 5^/11 (lanes 2), flx/YI 
(lanes 3) or A^col plus Bg(i\ (lanes 4). Bands in each digest (hat differ 
between the mutants and the wild-type are indicated by asterisks in the 
lanes containing the DNA from the appropriate mutant-infected cells 
(C) Electrophoretic analysis, as in (B), of PCR products generated 
from genomic DNA of CTLL-2 cells infected with the indicated hBc 
mutants. 



and HA with the hGMRa subunit failed to confer factor 
independence (data not shown). Nevertheless, hGMRa 
subunit expression did allow these cells to proliferate in 
1 ng/ml hGM-CSF (data not shown). 

Tumorigenicity of hpc point mutants 

The ability to confer factor independence on haemopoictic 
cell lines is generally indicative of tumorigenic potential 
(see Discussion). To test whether the V449E and I374N 
mutations were capable of conferring tumorigenicity, we 
injected factor-independent FDC-Pl cells expressing each 
of the mutant p subunits into syngeneic mice. Uninfected 
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Fig. 8. Analysis of BAF-R)3 cells infected with retroviruses encoding 
i»ciivaicd forms of h^c. (A) Flow cytometric analysis of h(k 
expression. Procedures and nomenclature are as for Figure 3. except 
lhat the panel labelled *V449E(FI)* shows staining of BAF-F03 cells 
infected with the V449E mutant and selected for factor-independent 
growth. (B) Proliferation of the BAF-P03 cells depicted in (A) in the 
presence and absence of mouse ILO. 



FDC-Pl cells and FDC-Pl cells expressing wild-type hpc 
were injected as controls. All four mice injected with 
V449E, I374N and, as expected (D* Andrea et ai, 1994). 
FIA-cxprcssing cells devjjdped tumours at the site of 
injection within 3 weeks, while none of the control mice 
had developed tumours by 9 weeks post-inoculation. 

Discussion 

Oncogenic potential of the GM-CSF receptor 
In this report, .we have shown lhat hpc can be activated 
by point mutations leading to single amino acid substitu- 
tions. To date, the only other member of the cytokine 
receptor family shown to be activated by point mutation 
is the erythropoietin receptor, in this case activation 
resulted /rom the substitution of cysteine residues in the 
extracclluar domain (Yoshimura et aL, 1990; Watowich 
ei aL, 1994). It seems quite likely that there may also be 
other point mutations that can activate h^c. For example, 
the studies described here have targeted only about one- 
third of the hpc cDNA, so there may well be potential 
activating mutations in other pans of the molecule and/or 
in associated a subunits. 

These findings, and the previous report of a structural 
rcanangcmcnt that leads to activation of h^c (i.e. the FIA 
mutant; D* Andrea ei a/., 1994), raise the possibility that 
activating mutations in hpc may c niribute to human 



malignancies— most likely, myeloid leukaemias. Lesions 
that confer factor independence on FDC-Pl and similar 
factor-dcpcndcni cell lines, such as autocrine GM-CSF 
production (Lang ct al.. 1985), expression of activated ! 
abl (Cook et ai. 1985) and fms (Wheeler et aL, 1987) 'i 
genes, usually lead to the acquisition of tumorigcnicity. < 
This is also true for the activating mutations in hPc, as 
described here and by D' Andrea et ai ( 1 994). Furthermore, 
many of these same lesions have also been implicated in - ^ 
myeloid leukaemias (e.g. Shtivelman et ai, 1985; Young " 
and Griffin, 1986; Ridge et al., 1990, respectively). The ' 
only survey (that we are aware oO which has examined J 
the hPc gene in human leukaemias revealed no evidence - 
for rearrangements (Brown et ai. 1993). However, that : 
study would not have detected point mutations, as it 
was performed by Southern analysis of genomic DNA. i 
Extending our present approach to provide a compre- 
hensive *map* of potential activating mutations in hPc 
will provide a rational basis for selecting, for finer analysis, 
particular regions of the hpc gene in leukaemias. 

implications for the mechanism of action of 
normal and mutant GM-CSF receptors 

Understanding the mechanism by which the mutations 
result in receptor activation will be intimately connected 
to understanding how the normal OMR is triggered by 
GM-CSF. Both of the main findings of this study — 
the nature of the activating mutations and the differing 
activities of the mutants in a range of cell types— bear on 
this issue. The differing abilities of the mutants to confer 
factor independence on BAF-FOS cells indicate that they 
exert their effects by distinct mechanisms, which is consist- 
ent with the positions of the muutions within hPc. 
Furthermore, our results imply that the mutants, and, most 
likely, the normal receptors, interact with cell-type specific 
molecules. It is difficult to avoid the conclusion that there 
are (at least) two such molecules present in FEXT-PI cells 
that are involved: one, which is required for signalling by 
the V449E mutant, is also present in BAF-F03 cells but 
is absent from CTLL-2 cells; and a second which is 
required for signalling by the I374N and FIA mutants but 
which is absent from both BAF-F03 and CTLL-2 cells. It 
is unlikely that either of these components are a subunits, 
as co-expressing the human GMRa subunit (Figure 6) or 
either the human or murine IL-3Ra subuniu (data not 
shown) with the hpc mutants in CTLL-2 cells dfd not 
result in factor-independent growth. Similarly, introduction 
of the human a subunits failed to allow BAF-F03 cells 
expressing the 1374N or RA mutants to proliferate in the 
absence of factor (data not shown). 

Mechanism of activation by the V449E mutatiqp. At the 
moment, the transmembrane domain mutation' V449E is 
probably the simplest to interpret since it is analogous to 
the activating mutation (V664E) found in the rat neu 
oncogene (Bargmann et a/., 1986). Several reports have 
indicated that the neu mutation probably acts by inducing 
constitutive receptor dimerization (Sternberg and Gullick, 
1989; Weiner et a/., 1989). Since dimerizaUon is believed 
to be the key step in ligand-induced signalling by all 
receptor tyrosine kinases (reviewed by Ullrich and 
Schlessingcr, 1990), the mutation has essentially the same 
effect as ligand binding. By analogy, we propose that the 
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V449E mutation acts by inducing ligand-independcnt 
dimerizalion of the P subunil and initiating the generation 
of intracellular signals. 

The exact basis for dimerization induced by'hydrophilic 
substitutions in the transmembrane domain is unclear, 
even though models have been proposed for Neu that 
invoke particular interactions between residues in the 
mutant transmembrane domains (Sternberg and Gullick, 

1989) or conformational alterations (Brandl-Rauf et qL 

1990) . Our mode> for V^9E is based on the striking 
similarity to activating mutations in Neu. However, the 
sequence surrounding the mutated valine in hpc does not 
conform to the ^rules' worked out for Neu (Cao et vL, 
1992), although both h|k and Neu have an alanine four 
residues N-terminal to the substituted valine. Moreover, 
not all receptors can be activated by such mutations. Thus 
it is very difficult to speculate as to why the V449Q 
and V449D mutations do not activate h(k; structural or 
modelling data will be necessary to provide further insight 
into the requirements for activation. 

The notion that p subunit dimerizalion can trigger 
intracellular signalling is supported by a number of 
observations peruining to murine or human p .subuniis 
and to other members of the cytokine receptor family. 
Several members of this family (e.g. the receptors for 
G-CSF, Epo and human growth hormone) clearly function 
as homodimers; in these ca.scs ligand binding induces 
dimerization which, in turn, results in the generation of 
proliferative signals (Ftikunaga ef aL 1990; de Vos et al, 
1992; Watowich et aL, 1994). A second important case is 
that of the IL-6 receptor which, like the GMR, is hetero- 
meric (Taga et aL 1989) and comprises a ligand binding 
a subunit plus the signalling subunit, gpl30 (which can 
be viewed as the equivalent of the GMR p subunil). 
Activation of the IL-6 receptor by ligand results in the 
formation of a heteromeric complex that contains a gpl30 
dimer (Murakami et aL, 1993) along with two IL-6 
receptor a subunits (Ward et aL, 1994). With regard to 
GMR/IL-3R/IL-5R P subunits, it has been shown that a 
chimera containing the extracellular region oftthe Epo 
receptor and the intracellular domain of the murine P,l.3 
(AIC2A) subunit can mediate Epo-dependent proliferation 
(at least in Ba/F3 cells; Sakamaki et aL, 1993). In addition, 
other experiments have shown that chimeras comprising 
the extracellular region of the GMRa subunil (K.Arai, 
personal communication) or the IL-5Ra subunil (Takaki 
et aL, 1994) and the intracellular region of hpc can 
transduce proliferative signals in the presence of normal 
hpc and the appropriate ligand. These data imply that 
dimerization of the intracellular portion of the P subunit 
is sufHcienl to initiate cellular proliferation. 

One model that could explain the inability of the 
V449E mutant (see Figure 6) and the EpoR-p,L.3 chimera 
(Sakamaki et aL, 1993) to function in CTLL-2 cells is 
illustrated in Figure 9A. The model assumes that the 
normal receptor complex (or at least one f rm thereof— • 
sec below) comprises two a and two p subunits, and 
proposes: (i) that there is a factor present in, for example, 
FDC-PI cells (factor T') which must be associated with 
hpc homodimers in order to generate proliferative signals 
but which is absent from CTLL-2 cells; and (ii) that 
association f a subunit plus ligand can compensate for 
the lack of factor Y by bringing into the complex another 



factor ('X\ which is present in CTLL-?. cells and probably 
in other cells also) which overiaps in function with factor 
Y. Thus, the dimerized P subunit could deliver proliferative 
signals in cells such as FDC-PI and BAF-F03 where Y 
is present, but could only signal in CTLL-2 cells following 
ligand-induced as.sociation with an a subunit. 

Possible candidates for molecules X and Y might be 
non-receptor tyrosine kinases such as members of the srv 
family, as the IL-3R and the GMR have been reported to 
associate with and/or activate the" products of the fes 
(Hanazono et aL, 1993) and the 5/r-related lyn (Torigoe 
et aL, 1992) genes. While the JAK.2 kinase is also 
associated with the IL-3R (Silvennoinen et aL, 1993), it 
is unlikely that X or Y is JAK-2 since the latter is also 
present in CTLL-2 cells (Mano et aL, 1993); it is more 
likely that JAK kinases are required in addition to X or 
Y. Alternatively, X and Y may be adaptor molecules that, 
by analogy with GRB2 (Lowenstein et aL, 1992) for 
example, bring other components of the signal transduction 
pathway into the receptor complex. A third possibility is 
that they arc members of the STAT family of transcription 
factors thai associate with many members of the cytokine 
receptor family (reviewed by Darnell et aL, 1994). 

Mechanism of activation by the l374Nand Flii mutations. 
For reasons discussed above— in particular their differing 
abilities to function in BAF-F03 cells— these mutations 
are unlikely to act in the same way as V449E, i.e. by 
fonnation of hpc homodimers. Moreover, as both are 
within the extracellular portion of hPc, it appears likely that 
they may function by interacting with another membrane- 
spanning molecule, "uch a situation would be superficially 
reminiscent of the case of the IL-2 receptor which has a 
third component, the y subunit (Takeshita et aL, 1992). 
However, the analogy is not strictly appropriate because 
the IL-2R Y subunit appears to be essential for signalling 
(reviewed by Taniguchi and Minami, 1993). One could, 
however, postulate that a myeloid-specific 'y subunit' 
could associate with an aP dimer to provide a function 
which overiaps that of a second p subunil. Functional 
receptors could then have one of two compositions— 
(aP)2 or aPy. Thus the I374N and FIA mutants may form 
Py complexes constiiutively, i.e. in the absence of a 
subunits or ligand, and generate proliferative signals in 
myeloid (e.g. FDC-PI) cells but not in CTLL-2 or 
BAF-F03 cells, which represent the T- and B-cell lineages, 
respectively. Consistent with this, preliminary studies 
have indicated that FIA (and V449E) can induce factor- 
independent colony formation by myeloid progenitors 
from murine fetal liver (unpublished observations of 
M.McCormack and the authors). This model is illustrated 
in Figure 9B. 

It is not clear how the I374N and FIA muutions 
could induce constitutive association with an additional, 
membrane-spanning receptor component such as a putative 
Ysubiinit. However, one plausible explanation comes from 
examining the structure of the regions involved. Sequence 
alignment of the extracellular regions of hPc and hGHR 
(Bazan, 1990; Goodall et aL, 1993) suggests that 11^374 
lies in the P strand C of the membrane-proximal domain 
of hPc. Furthermore, predictions based on the structure of 
hGHR (de Vos et aL, 1992; CJ.Bagley and A.Lopcz. 
pers nal communication). suggest that the side chain of 
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2^p£ StSni ?^ r .ni!! T '^'"P*"^ ''^ « Prolif/rativc signalling 

^cxisien^ra iWrf "f^^ cxp anat.on. (B) A model pertaining lo ihc D74N and FIA mutants which invokes 

TSt nf } ^ 'P^*'**^ "The model postiflates that (he y subunit can associate with 

aisocitlKlh thTv wbu\^ 1^ cells: the I374N and HA mutations would then function by allowing constitutive 

^^Tp^l P ^""""'^ ^o'^Pl"" in CTLL.2 cells when both a subunit 



Ilc374 may contact the hydrophobic side chains of valine 
residues in the conserved RVRVR sequence within the F 
strand, of hPc. Intriguingly, the F strand is part of the 
region duplicated in the FIA mutant; thus, both the 
duplication in the FIA mutant and the hydrophilic substitu- 
tion in I374N may be affecting a common structure or 
surface thai includes strands F and C— and which, we 
suggest, may be involved in interacting with the putative 
YsubuniL A recent report by Grubc and Cochrane (1994) 
supports the notion that strand C may play a regulatory 
role in interactions with other receptor subunits. as it was 
found tiiat a peptide corresponding to strand C of the IL-6 
receptor (a subunit) can inhibit signalling by IL-6 without 
affecting ligand binding. We therefore suggest, in a similar 
vein to D' Andrea et ai (1994), that association with 
ligand plus a subunit, a disruptive amino acid substitution 
such as I374N in the p strand C or duplication of strand 
F may all act to unmask an interactive surface involving 
strand F, allowing association f hpc with the putative y 
subunit and triggering the generation of intracellular 
signals. 
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Both of the models illustrated in Figure 9 require a 
degree of functional redundancy between components of 
the OMR (i.e. between X and Y, and between tiie p and 
'y' subunits). This redundancy may not be complete, 
however, because the studies presented here only address 
proliferation in established cell lines; it is possible that 
the 'redundant' components may vary in their abilities to 
generate, say, differentiative signals. 

We also wish to point out tiie potentially general utility 
of the methodology used to isolate the mutants described 
here. While a retroviral library of random mutants has 
already been used by Druker and Roberts (1991), the PCR 
mutagenesis approach used here is much simpler than the 
chemical mutagenesis employed by tiiese authors. Because 
many receptors, including members of both receptor 
tyrosine kinase and cytokine receptor families, can induce 
ligand-dependent proliferation of factor-dependent haemo- 
poietic cell lines (e.g. Di Rare et aL 1990; D' Andrea 
et aL 1991; Sakamaki et a/.. 1993; Ziegler et aL 1993), 
the procedure described here offers tiie possibility of 
identifying activating muutions in any of these molecules. 



Furthermore, this option may als apply to Intracellular 
molecules that are involved in receptor-mediated 
signalling. Finally, it should be* possible to extend this 
approach to ther types of screens using different cell 
types, including primary cells, and t devise screens 
for loss-of-function mutants as well as gain-of-function 
muunts. This could facilitate the identification of func- 
tional domains and structures in molecules involved in 
other cellular functions such as differentiation, adhesion 
or gene regulation. 



Con»tituthf« MtWation of h^c by point mutation 



Tible II. Effcci of Mg2* and Mn^* conceniniiionit on PCR 
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(MnCI;| mM iMgOij mM Bases sequenced Muuiion i 



0 

0.1 
0,25 
0.5 
0 



7.0 
7.0 
7.0 
7.0 
1.5 



2520 
>900 
3060 
2S00 
2200 



1/840 

1/510 
t/400 
1/2200 



Materials and methods 

Celt fines snd cDNAm 

PA.1I7 (Miller and Butiimore. 1986) and ^2 (Mann et al., 1983) 
retfovinjs packaging cell lines were mainuined in DME medium 
supplemented with 10* fetal calf serum (PCS). The lL-3/GM-CSF- 
f^T^^l"^"^ '"y*'**'** ^Dexter H ai, 1980). and 

ihc BAF.B03 subline (Haiakeyama et aL 1989) of the IL-3^ndcni 

'^ilT^^ ^ «m»e^ium. 

as above, cooiaining 80 units/ml mouse CM^SF (produced by an 
enginected «nin and kindly supplied by Dr Tracy Wilson. Walter 
^Jr""" Melbourne) or m units/ml mouse ILO 

(produced rrom a baculovirus vector and kindly supplied by Dr Andrew 

K ^tl^'^at ^^^^ CirberTi).',espect7vS^ 

The IL.2-depcridem mouse T^ll line. 00X^2 (Ceroctini et 

"^^Hi^T^ RPMI.I640 medium supplemented with 10% FCS 

and 50 MM 2-mcrcaptoethanol. in the presence of 100 units/m! bacterially 

syntteuDed mouse IL.2 (expressed from the plasmid pTOCH which 

was kindbr provided by Dr CZurawski. DNAX, PaJo Alto) or m 

conditioned medium from MLA cells (CTLL medium) 

•n»e hPc cDNA (Hayashida « a/.. 1990) used here wzt that described 

by Barry et ai (1994). cDNA for hCMRa (Gearing et oL 1989) was 

ktnd^ provided by Dr Nic Nicola (Waller and Eliza Hall Institute 

Melbourwr. Australia). The activated RA mutant of hSr has been 

dcscnbcd previously (D* Andrea et aL 1994). 

pfasm^^ '"^^^ff^ff ^» snd constrvetion of expnssion 

SiteKiirected muiagenesU was carried out on single-strandcd DNA with 
mutagenic oligonucleotides using the pAlier-l system (fWga) in 
^I^J^'^J^ manufacturer's instruction,. All mutations were 

we« blun nend l.gated between end-filled Bglll and Xhol restriaion sites 
oTlhc pRUFNco retroviral vector. The pRUFNeo retroviral vector was 
modified from ihe version tJescribed by Rayner and Gonda (1994) by 
restriction site from the miWtipte cloning site. 
, . PRy™'«> retroviral expression vector was constructed by 
replacing ihetMCINeo cassette of pRUFNeo with an SV40/puromycin 
. resistance cassdte from pBabePuro (Morgenstem and Land 1990) A 

cSSlA imn'^rS"? I^^*?** constnicted by inserting the hCMRa 
cDNA into the Xho\ site of pRUFPuro. 

To faciliuie cloning of hpc PCR fragments (see below), a silent 

t™T^ h?mT*k! "^J"' "'^ ^ '^^"^^ wS. 

type hpc cDNA by changing nucleotide 772 (Hayashida et at 1990* 
«^nce aacssion number M38275) from C to C. Tliis cDNA wai 
then inserted into pRUFNeo as above to generate pRUFNeo-hpc. ^ 

^iAflb^'^ coiifimrt/a/i of « polnt^utated hfie 

R«Kk>m point mutations were introduced into a 940 bp XHoI-BbIII 
^cgmeni, bases 770-1710 (Hayashida et «/.. 1990) of the hpc cDNA. by 
^"^"^ n**^ d«cribed by Cadwell and Joyce 
xjT: ■ ■ „ . f?"^ ****** amptification were S'-TOOAGCCCAG- 
^SSTTTSS^^^-^' (nucleotides Tl^737) «^d 5'<5GGCCC^ 

l^^^'^^^^^ilPS^'^' I804-|780)?SS; 
109 bp fngnwit Standard and muugenic reactions (in a volbme of 
50 jil) were performed on 5 ng of pRUFNeo-hJc plasmid DNA using 
12 fiM of each primer, and were cycled in a MiniCyckr (MJ Reseaidi) 
for 30 cycle, of 94-C for I min. 68-C for I min ^ 72-C for^ 
Reactions were denatured at 95*C for 5 min prior to cyclihg. StandanI 
" equimolar concent^tion of dfTTPs (0.2 mM). 
1 .3 mM MgO, and 2 J units of Tag polymerue (Perfcin Elmer), whereas 



reactions contained 0.2 mM dATP and dCTP. I mM dCTP 
and dTTP. 7 mM MgCh. 5 units of Tag polymerase and v^ 
conceniraiions of MnOj. Toestimaie mutation frequencies. PCR products 
"^"^ wbcloned into appropriate pCEM 
vectors (Promcga) and sequenced; the results are shown in Table II No 
strong mutational bias was observed umk. these reaction conditions, 
although tl«re was a weak bias towards T to N and N to A substitutions 
similar to that reponcd by Cadwell and Joyce (1^2). 

A library of h^c fragments conuining mutations at the desired 
frequency (0.2%) was regenerated by performing the mutagenic PCR as 
S^TLIU ^nr'^'^ tn '^J^ *^''^'- '^o^o^ing digestion with 
li«i ' '"r"- gel-purified and 

hgated ,l,rect.onally into the pRUFNeo-hfJc from which tf« Xhol- 
Bgm fragment of hpc had been excised. Following transformation of 
r^^fr™*^^^^^ the result«.t hpc cDNA 1^ 

'n^n^t, '•^^.'^\*«^P«'»^"» P»«mid clones was fu^ 

amplified as described previously (Rayner and <3onda. 1994). 

FK^PI^^Ih ^^^^ f^ctor^ndepondent 

* retroviruses as 

descnbed by Rayner and Gonda (1994). Briefly. PA3I7 amphotropic 
packaging cells were inmsfccted with pRUFNeo-hpc"" plasmid DNA 
and virus^ontaming transient supematanis were filtered and used to 
infMt ^2 ccotropic packaging cells. Infected ^2 cells were selected in 
medium containing 400 Mg/ml 0418. pooled, and used to infect FDC- 
co<u!tivaiing 5X|05 FDC-PI cells with 
10* irradiated (30 Gy) y2 producer cells for 48 h in each of fouVS^^ 
flasks. -n»e FDC.PI cells ftpm each- flask were harvested, washed 
and selected for factor-indepcndem growth in liquid cuhure medium 
containing 0418 (I mg/ml) without GM<hjF. Plating in soft agar 
was performed essentiajly as described by Johnson (1980); GM^TSF 
(80 units/ml) or C418 (I mg/ml) was added as required. 

Rescue of mutant hfie cDNAa from factor^ndapandent ceih 
Genomic DNA was isolated from cells using a Proteinase K/SDS 
procedure essentially as described by Hughes et ai (1979) PCR was 
performed on 100 ng of genomic DNA with P/w DNA poly^rase 
(Stratagenc) under conditions recommended by the manufacturer The 
pnmers and cycling parameters used were as described above PCR 
producu were inserted into pGEM vectors (Promega), and sequenced. 

Characterizathn of activating hfie point mutations 
Point mutmions generated in h^c by sitenlirectcd muugcnesis were 
^i^r !? '?."''^ ^"^"^ irxtependence by expression in 
"^^^'^ ^ ifwsfecied by 5* calcium 

phc«phate procedure (u described by Rayner and Qonda. 1994) and 

^T^^'H^nlT^.'^^^ <^ ~<"Wva. 
Z^i^J j^'^!^^ ^ ^ co<ultivation and 

ItS? ^^J^^'^-'^fV^x growth as before. For Cnx.2 infections. 
4X 0* ttably transfected 4^ cell, were ir«diaied (30 Oy) and co^ 
cultivated with 10^ Cru.2 cells in CTUL medium (see rt«^e) 

47 aU, and selected in CTLL medium containing 0418 at 750 ng/ml. 

^n iS^ZLr"* '^'^^ were washed and incUSted 
in medium with or wiihoul ftctor. 

WA «d «ubl* t«mf«cums sekcud with punmiycin «t 2 (igAnl. • 
Siir^ V*^2L^ muu«i were infeeied by wxuUivwiSi as 
before iiMj «le«ed in CTTX nwdium comiining puiomycin « 2 Mg/ml 
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Ml Borting snd BnMlytiM f rectptor subunh expressi n by 

^ nowcytonftry 
Selected cells expressing hGMRa were collected by cell sorting on a 
FACSuf'*^* flow cytometer (Becton-Dtckinson). Briefly, cells were 

- washed ind resuspended in cold RPMI-1640 medium supplemented with 
5% FCS (RPMI-FCS). Cells were incubated with the anii-hGMRa 
monoclonal antibody 8G6 (Q.Sun and A.Lopez, manuscript in prepara- 
tion) for 20 min on ice. washed and subsequently incubated with RTC- 
conjugated anti-mouse Igg (Silenus) for 20 min on ice. After washing 
and resuspcnsion in cold RPMI-f=CS. (he cells were sorfcd. and (he 
positive cell population collected in CTLL medium. 

Expression of \fic mutants on the cell surface of infected FDC-PI. 
BAF-P03 or CTtL-2 cells was confirmed by flow cytometric analysis 
using an Epics-Profile 11 analyser (Coulter) by staining, as described 
above, with the anti-hPc flionocl9nal antibody 3D7 (Q.Suir and A.Lopez, 
manuscript in preparation). hGMRa expression was analysed as above 
by staining with the monoclonal antibody 8G6. 

Ml proliferation asssyn 

Infected cells (FDC-Pl. BAF-P03 or CTLL-2) were washed twice and 
triplicate samples of equal cell number (5x 10^) were cuhuied in a 96- 
well mtcrotitre plate with or without appropriate growth factor for 72 h. 
Cell proliferation was nr>easuped by the Ceimicr'% Non-Radioactive 
Cell Proliferation Assay (Promega). 

TumorigonidtY studies 

FDC-Pl cells (5X 10*) expressing each of the muunt P subunits (V449E. 
I374N or FIA) were injected subcuianeously into each of four 10- week- 
old nule DBA2 mice; as controls, mice were injected with cells 
expressing wild-type hPc or uninfected FDC-PI cells. Tumour formation 
was monitored at weekly intervals. 
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Note added in proof 

We note a recent publication describing the activation of the mouse Be 
subunit by an amino-terminal truncation [J.Hanncman. THara M Kawai 
A.Miyajima, W.Gsieriag and CStocking (1995) Sequentiar mutations 
in the inierleukin.3 (IL-3ygranulocyic.macrophage<olony-stimulatin2 
facior/IL.5 receptor ^subunit genes are necessary for the complete 
conversion to growth autonomy mediated by a truncated Be subunit 
Mol. Celt Biol., 15. 2402-24121. 
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Construction of retrovirus expressing kBc Th*- mncf« ^- 
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Rg 1. Generation and characterization of factor-independent celli. (A) Diagrammatic representation of the h^e retroviral construct The 
vector used b pRUFmlNoo." Only relevant restriction sites are shown. (B) Southern analysts of human tL-3-dependent celb (FOOPI cells 
expressing hlL-3Ra and hfic) and the factor-independent hfie FDC-P1 ceR One. Filters were probed wHh an l^c coding fragmmt The wild- 
type h/Sc fragments are 4.4 kb in tiie Xba I digest and 3.4 Idi and 1.0 kb bi the Xba VBgl D digest Clearfy visible is a larger hfic fragment In 
the factor-independent FDC-P1 cells. (Q PCR from genomic DMA with Internal hfic primers. The product generated spans tha transmembrane 
domain and includes a small segment of cytoplasmic sequence. Une a. markers. ^X174 Hoe HI (Pharmada, P-t Biochemlcals Inc. KWwaukae, 
Wl). Lane b, PCR using genomic DMA from factor-independent FDC-P1 cells as template. Une c, PGR using genomic DMA from human 0.-3- 
dependent FDC-P1 celb (as In [B]) as template. PCR producU spanning the N-terminal and C-terminal regions of h^c were Identical In both 
samples (data not stK>wn). 



of cells (4 X lO') were incubated in the presence of ('H] thymidine 
and DNA synthesis measured using a standard protocol described 
previously.*' 

in vitro mutagenesis. For site-directed mutagenesis, the frag- 
ment to be mutagenized was cloned into pALTER (Promega) and 
single-stranded DNA prepared after infection of transformed Esche- 
richia coii with helper phage (R408). DNA was prepared and muta- 



genesis performed with a mutagenic oligonucleotide following the 
protocols provided by the manufacturer. 

RESULTS 

Isolation and characterization of factor-independeni cells. 
In the course f ur studies using FDC-Pl cells infected 
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With an h/?c retrovinis. a cell line arose spontaneously that 
in the absence of mouse GM-CSF and IL-3 (see Materi- 
als and Methods and F.g I A). To dctcnnine possible causes 
for this, wc first examined cell supematants for the presence 
of factors that could support the growth of FDC-PI cells 

Sll^d factor-independent 
ccUs did not support growth of untransfccted ccUs. indicating 
Uiat factor mdcpendencc was not caused by autocrine prl 
durtion of growth factor. Secondly, we tested these cells for 
overexpression of h^c. However, immunostaining with the 



h^c-specific monoclonal antibody. 5F9, indicated that these 
cells were expressing similar levels of h^c to control human 
IL-3-dependent cells (data not shown). We characterized 
these cells further to determine whether there had been any 
alteraUon in the h0c cDNA on infection and generation of 
factor independence. Southern analysis indicated that two 
retroviral integrants were present in a factor-independent cell 
line (Fig IB. HindUl digest). DigesUon with a restriction 
enzyme that excises the proviral DNA (Fig IB, Xba I and 
Xba VBgl II digests) indicated the presence f a larger than 
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Growth factor 



Fig 2. Prolifaration of r«trovir«lly infected FDC-P1 calls niMsured 
by I*H) thymkfim Incorporation.'' (B) Calls sxpressing hfii oontainii^g 
tho dupUcatsd togmsnt and with tho singis amino add attaration 
wa<^typa; (■) origtnaf factor-indapandant calls; (■) calls 
axprassing hfic from a ratrovirus containing tha racovarad PCR frag- 
mant (saa Hg 1C); (■) calls Infactad with tha pRUFiaNao vactor; {■) 
calls Infactad with a wfld-typa h^c ratrovirus (Rg 1A). Prasanca or 
absanca of growth factor (GM-CSF) is indlcatad. 



normal internal fragment in one copy of the provirus. PCR 
amplification of DNA derived from the factor-independent 
cells using internal hfic primers indicated that there was an 
alteration giving rise to a larger internal h^c PCR product 
(Fig IC, lane b), consistent with the Southern analysis. The 
altered PCR product spanned the transmembmnc domain and 
membrane proximal segments of the receptor. Nucleotide 
sequencing of cloned PCR products generated from these 
cells indicated that the larger h^c product contained both a 
single point mutation (resulting in alteration of threonine to 
lysine) and a tandem duplication of 1 1 Ibp; this r^ulted in 
a 37 amino acid duplicated segment in the extracellular por- 
tion of the receptor, proximal to the transmembrane domain 
(Fig ID). We have reconstructed an h0c retrovirus con- 
taining this altered fragmcnL A full-length hfic receptor se- 
quence was reconstructed in the retroviral vector, pRUFnl. 
Neo, using restriction fragments from h^c subclones and the 
0.8-kb PCR fragment (see Fig IC). % ceUs were transfectcd 
with the resultant plasmid, pRUFni^cHA. and the G418- 
resistant producer cells used to introduce retrovirus into 
FDC-Pl cells. Cells infected with this retrovirus were capa- 
ble of growth in the absence of added growth factor. CeUs 
infected with control virus (either vector alone r b^c retro- 
virus) gave no growth in the absence of added growth factor 
(Rg 2). To show that the duplication alone could c nfer 
factor-independent growth, we cloned the reconstructed h^c 
mto pALTER (Promcga) and restored the altered lysine resi- 
due to threonine by site-directed mutagenesis. FDC-Pl cells 
selected f r G418 resistance after infccti n with this retrovi- 
nis (pRUFnl^cFItmo) were again factor-independent, indi- 
cating that the point mutation was n t involved in generating 
factor independence in these cells. CeUs infected with mutant 



and conu-ol retrovirus were also plated in agar with and 
without growth factor. Only cells infected with pRUpNijSc- 
FIA formed G4l8-resistani colonies in agar in the absence 
of added factor (data not shown), consistent with expression 
of the mutant h^c being sufficient to confer factor-indepen- 
dent growth on FDC-Pl cells. Qearly. factor independence 
in the original cell line is not conferred as a result of retrovi- 
ral integration or sequence alteration in another region f 
h^c. We can also rule out a requirement for the normal h^c 
product, derived from the second retroviral insertion, in the 
factor-independent phenotypc of the original cell line. We 
interpret these results as indicating that the alteration in re- 
ceptor sequence has led to constitutive activation of the h;3c 
subunit, 

Tumorigenicity. Although members of the cytokine re- 
ceptor superfamily have not yet been associated clinically 
with oncogenesis, an activated form of the murine erythro- 
poietin receptor (mEpoR) has been reported and shown to 
have lumourigenic potential."^ We therefore tested the lu- 
mourigenic potential of the mutant h^c molecule by injecting 
factor-independent and control FDC-Pl cells into syngeneic 
mice (Table 1). Control cells, infected with the pRUFnlNco 
vector or with a wild-type h;9c retrovirus, were not tumouri- 
genic. However, FDC-PI cells, infected with pRUF)5cFIA 
(a retrovirus expressing the mutant form of h^c), generated 
solid tumors with a short latency in all mice injected This 
result shows the tumourigenic capacity of activated hfic and 
emphasises the oncogenic potential of the large family of 
cytokine receptors. 

DISCUSSION 

We have shown that a small (37 amino acid) duplication 
of exu^cellular hfic sequence is sufficient to confer ligand- 
indcpendenl growth on FDC-Pl cells and leads to tumouri- 
genicity. We believe that the sequence defined by this dupli- 
cation may be important in signalling because the altered 
receptor structure must in some way mimic a ligand-induced 
signalling event Structural predictions suggest that the seg- 



Tsble 1. Tumourioenicfty of Transfocted FDC-Pl Colls 



Calls Inioctod 


Tumor* Observed/No. of Mi 


C0 Injected 


HBBS 


0/3 




FDC Pl (pRUFNeo) 






FDC-Pl (pRUFh^) 


m 




FDC-Pl (pRUFh^lA) 


5/5 





FDC-Pl cells were infected wrth wild-type end nriutant h^ retrovirus 
(indicated in parentheses) and selected for G418 resistance. To assess 
tunr^ourigenicity, cells were injected into syngeneic 0BA2 male mice 
(8 weeks old). Cells (5 x 10« per mouse) were washed in serum-free 
medium and resuspended in Hanks' Balanced Salt Solution (HBBS; 1 
X 10' cells/mU and 0.5 mL was injected into mice subcutaneously at 
a single site. Mice were ottserved for 2 to 3 monUts for signs of 
palpable or visible tumours at the site of injection. Tumourigenic cells 
gave rise to visible masses with a short latency after injection (2 to 
3 weeks). Mice with solid tumors were killed and autopsies were 
performed to assess the extent of tumor formation. Representative 
mice injected with nontumorigentc cells were also killed and post 
mortems were performed to ensure Oiat tumors had not developed. 
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ment duplicated comprises two ^-strands (E' and F') and 
two connecting loops. E'-F' and F'-G'.'" It also includes 
the WSXWS motif and an adjacent conserved basic region 
(consensus YXXXVRXR. see Patthy") corresponding to the 
predicted ^-stnuid F' (see Fig 3A). The following mecha- 
nisms of acuvation are possible: (I) acUvaUon occurring 
through association of mutant hflc with murine receptor a- 
chains expressed on these cells; (2) acUvation through an 
altered tertiary structure that mimics a ligand-induced event- 
and (3) the duplicated segment allows h/Jc dimcr foiroation 
m the absence of ligand and hence plays a direct role in 
acuvauon of the mutant receptor. The duplicated segment 
may form an exposed site mediating interaction of two hfic 
molecules. In the normal receptor, the single copy of this 
site maybe involved in dimer formation, but only in the 
presence of specific a-chain and bound ligand (Fig 3B) 
Induect support for this third model ([3] above) comes from 
rc»ent studies with gpl30. the signalling subunit for IL-6. 
CNTT. and IJF. Based on the fact that receptor complexes 
for n^3. IL-5. and GM-CSF share a signalling subonit. as 
do those for IL-6, Oncostatin M. CNTF. and UF. it has been 
suggested that h^c may fonn a dimer in a similar fashion 
to gpl30.^ The observation that the signalling subunits 
h^c and gpl30. have homology in their intracellular do- 
mains further suggests tiiat there are similarities in the 
mechamsms mediating signalling. Additional support for this 
model comes from a study with a chimeric receptor con- 
taming the extracellular domain of the mEpoR and the intra- 
cellular domam of a murine Iv3c homologue. AIC2A " This 
chimera mediates Epo-dependent proliferation consistent 
with the acavated form of AIC2A being a dimer 

Several studies on other receptors within tlfis superfamily 
suggest the importance of this conserved membrane proxi- 
mal segment m signal transduction. The role of the WSXWS 
motif has been extensively studied, although results are con- 
flicting. Whereas mutations in this sequence have been re- 
ported Jat abolish ligand binding." mutations in the 
W5XWS motif of m^R suggest a role in signalling » A 
comprehensive mutagenesis of the IL-6 receptor a-chain (IL- 
6Ra) Identified seven mutations that appear to specifically 
abolish signalling by preventing association with a sienallinE 
subunit. gpl30." All except one of these lie in su^E' 
and F' and the adjoining loop (underlined in Fig 3A) im- 
plymg that this region is important in receptor subunit as^oci- 
.Tl;^^ observations, together with the recem evidence 
that IL^Ra complexes with a gpl30 dimer.*« suggest that 
Jerc could be a region in IL^Ra that includes the conserved 
^-strand F (and perhaps the WSXWS motiO that is im- 
POttant m interaction with one of the gpl30 molecules. Per- 
S." ^ this region is important in dimer formation 
The h^c receptor contiuns two of die characteristic cyto- 
kwereceptor modules' and we speculate Uiat the N-terminal 
re«1>tor stiucture masks a membrane proximal site involved 
m dimeruation. thereby preventing association. Other cloned 
^ fane receptor subunits. LIFR/?« and MPL,^ resemble 

tor *^ ""^^ » "^"^ 'yto'^nc recejv 

tor module. Tnmcation from the N-terminus may be ex^ 
1^ t activate these recepton. Although we have not yet 
tested N-terminal tnincations of h/Jc. the one gene y.mpl« 
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represents a truncation of this type. It is notable that the 
extracellular region remaining in the y-mpl product includes 
ti»e conserved segment duplicated in the h/Jc rautiit (see 
Fig 3A). consistent with some of these sequences being es- 
sential for signalling and perhaps contributing to formation 
of a dimenc complex. 

In conclusion, the activated form of h/3c suggests that a 
membrane proximal extracellular region of cytokine receptor 
subunits has an important role in signalling and we suggest 
diat this region may be critical for dimeriration (Fig 3B) 
Duplication of Uiis segment may allow hfic association in 
the absence of ligand and haice constitutively activate the 
receptor. Indeed. mEpoR is activated by forced dimerization 
through a mutiition that intniduces a cysteine residue » This 
constitutive EpoR mutation and the h/9c mutant described 
here suggest that mutations leading to inappropriate receptor 
subunit association could be a possible mode of oncogenic 
acuvation for several cytokine receptor family memben!. 
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Infe^L'Swn 3 "rJ/'^rr Signaling Subunit for 

lnterleukin.3 (ILO) Granu Colony-Stimulating Factor 

(GM-CSF), and IL-5, Lead to Ligand-Independent Activation 

By Richard J. D'Andrea. Simon C. Barry. Paul A.B. Moretti. Karen Jones, Sarah Ellis. Mathew A. Vadas 

and Gregory J. Goodall 



The hypothesis that extracellular truncation of the common 
receptor subunit for interleukin-3 (IL-3). oranulocyte-mfccro- 
phage colonystimulatlng factor, and IL-S (h/}e) can lead to 
ligand^ndependent acthration was tested by infecting fac- 
tOTKlependent hematopoietic cell lines with retroviruses en- 
coding truncated forms of h^Tc A truncation, resembling that 
to and retaining 4S McKlerived extreeellular residues, 
todto eonstftuthr. artlvatlon to the murine myeloid cell line, 
IWM. However, tofection of cells with retrovirus encoding 
a more severely truncated receptor, retaining only 7 h/Jc- 
derived extraceUular residues, did not confer factor todepen- 
*wice on these cells. These experimento show that trunca- 
tion acthntes the receptor and define a 37-amtoo add 
•egi^of hpe (H395-A431) which contains two motifs con- 

«^ Y/H XX R/Q VR and WSXWS), as es«,Vrtial for factoJ" 
Independent signaling. The mechanism of activation 

"n ECEPTORS for hematopoietic growth factors (HGFs) 
XX arc members of the expanding cytokine receptor super- 
family that is characterized by a 200-amino acid extracellular 
reaptor module (CRM) composed of two discrete folding 
domauis. each of whidi contains seven fi strands folded into 
anupar^el sandwiches, and bears a structural similarity 
to the fibronectin type ffl module and the Ig constant do- 
muns. ^ TTiis stnicture has been confirmed for two members 
Of the family, the growth hormone and prolactin receptors 
by x-ray crystaUogiaphic studies.'^ In addition to this simi- 
lanty intetiaiy stiuctMre. mi^^ 
a oiifflber of cwjscrved sequence elements: (1) four con- 
saved cysteue residues Ideated in the N-tetminal domain 
(2) a membrane proximal tip-Ser-Xaa-Trp-Ser (whei^ Xa^ 
IS any amino add) motif, also I^wn as the "WSXWS 
box." I«atcdinthcC-tenninaldeinain,'-''»and(3)apioline- 
nch motif (PRM). which may be in wived in signal transduc- 
tion, located in the membrane proximal region of the cyto- 
pbamc domain of most leceptbrs in this family." Members 
f this famdy of recq>toR do liot contain rca)gni2able tyro- 
sme kinase domains; signaling depends on association with 
cytopIasmM; tyrosine kioases of the JAK ftmily and die sub- 
Si'^ signal transducefs and activators of 
trtoscriptfibl*'*"' I" /V'-ni! : • - : "^r^ 

■CMstitutive 'mnti^ in cytokine receptors have pro- 

^gmutatum h^ ^ structure of the 

normal rec^ itf tfie ligand-actiVated state and therefore 
could provide impoitam clues to the activation process 

SlL^Sj;^ "'* "^^^ i" «" pmlifi:^^ 
survival these acuvahni mutations may be predicted to hav 
racogemc potentiaL Indeed. the murine y^l oncogene en- 
^^^S^'^^ cytokine receptor 
£ 52^^15' ^ :»«i«*.nyel6prollf«tive leukemia 
iTL^A^ ^^'^ ^^^"^ acid fusion proidn^ 
Wrfci, the-flW iqoadir^^i^^ derived bom^£. 



was also invesfgated m less sever, truncations. A receptor 
that retains the entire membranei»roxlmal domato (domain 
4) also conferred factor independent growth on FDC-Pl cells- 
however, a retrovirus encoding a truncated form df hflc hav- 
!v ^aii"*"^ '"^T'brane proximal domatos did not have 
f "Z^- »"90esting that domain 3 may have an tohibitory 

[„?.i T ^ '""^r ^^-^ "^-P*"" »«» confer 

demonstrated by their to- 
duSilr '•««:;"'*P«'»«tent growth when intro- 
duced Into the murine IL-S-dependent pro-B cell Kne BaF- 

ISv^^J."- ~?*^*«» " model to which 

activation requires unmasking of an Interactive receptor sur- 
face in domato 4 and association with a myeloid-speeifie 
[^n!l!nrT"^ component We suggest tt«tlntt.e 

;?oprirsiS^^^^^ "'-^^ ""^ 

© 1996 by The American Society of Hematology. 

gion and the other 184 amino acids are from c-mpl " which 
encodes the reccp.or for thrombopoietin nrPO).'^ " The ex- 
tracellular domain is mosUy composed of the F-MuLV env 
sequences, but with 43 amino acids derived from c-Mpl (Re 
lA). Thus in v-Mpl most of the TPO-R extracellular domain 
has been deleted, generating a truncated receptor. Although 
the mechanism of activation of the y-mpl product has not 
been determined it has been shown that Mpl can be activated 
by cysteme substitutions in the predicted dimer interface 
doinain. suggesting one normal mechanism of activation 
probably myolves ligand-induced hoinodimerization." 

Forced receptor homodimerization is involved in aberrant 
activation of other cytokine receptors. A constitoiti vdy active 
form of die erydiropoietin receptor (EPO-R) containing an 
aiguune-to-cystcine substitution at position 129 has been 
dcscnbcd. niis R129C form of EPO-R forms disulfide- 
Imted homodimas in die absence of EPO, suggesting diat 
wdd-^ EPO-R is also adlivaied by ligarid-induced homo- 
dimerization. tofection of mice widi recombinant retrovirus 
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Hg 1. CytoUM raceptor m. 
traceOular truncations. (A) Sche- 
matic mustratlon stiowtng th« 
maior Moan of Mpl (hMfd-P) 
and tha v-MpI fusion product 
Mpl has two CRMs, tha distal 
ona containing an Insartion of SO 
rasiduas. tha v-MpI product is 
darivad from tha MPLV ratrovf- 
rus and is a hision protain con- 
taining raanangad mt¥ aa- 
quancas hisad to truncatad MpL 
Tha conserved cystaina rasiduas 
and WSXWS motif charactarfsttc 
of tha cytotcina recaptor famtty 
are indicated fay horfzontel Ones. 
(B) Schematic illustration show- 
ing the axtracaQular truncation 
muUnts of i^ that have been 
ganarated. Sita-dlracted muta- 
genesis was used to construct 
in-frame deletions of the FLAG- 
1^ cONA. Residues delated are 
as folloifvs: hficAU, E25-E232, 
hficAQP, E2&W37; hficAK 
E25-A394; I^WS. E2S^1. 
the signal sequence and posi- 
tion of tha introduced FLAG oc- 
Upeptide are Indicated. Con- 
satved cysteine residues and the 
WSXWS sequence that are char- 
acteristic of the cytoidne re- 
ceptor fsmHy are shown by 
horizontal lines. Extracellular 
domains are numbered from 1 
to 4. 



expressing this receptor induces erythroleukemia.*' The in- 
troduction of other cysteine residues into the EPO-R mem- 
brane proximal domain also leads to disulfide-linked homo- 
dimcrs that are constitutively active." A mutation in the 
transmembrane domain of the human common /? subunit 
(h^c) for intcrleukin-3 (11^3), granulocyte-macrophage col- 
ony-stimulating factor (GM-CSF), and IL-5, which presum- 
ably works through a similar mechanism, has recently been 
described.'' in this mutant (h^cV449E), glutamic acid is 
substimted for valine 449 in the transmembrane domain, 
conferring factor-independent growth on FDC-Pl and BaF- 
B03 cells. This mutation is analogous to an activating muta- 
tion in the c-neu proto-oncogene (HER-2, erb-B2 receptor), 
which leads to receptor oligomerization via a transmem- 
brane-mediatcd association.^ An activated form of the mu- 
rine fic subunit has recently been isolated from a spontane- 
ous, factor-independent subline of the promyelocytic cell 
line, D35. In this receptor the extracellular domain has been 
replaced by 34 amino acids encoded by intron 10 sequences 
and it is posmlated that the replacement of the normal extra- 
cellular sequences with the intron encoded segment facili- 
tates honruxlimerizati n, perhaps through an extracellular 
cysteine bridge.^ 

Wc have recently described two Ugand-indepcndcnt muta- 
tions in the membrane proximal, extracellular domain f Iv5c 
that suggest an alternative mechanism of aberrant cyt kine 
receptor activaUon. One f these (h^FIA) is a duplication 



of a short (37-amino acid) receptor segment including the 
WSXWS motif" and the second 0v5cD74N) is a point muta- 
tion leading to a single amino acid substitution (isoleucine 
374 to asparagine).** Both mutant receptors confer growth 
factor independence and tumorigenicity on FDC-Pl cells. 
The mechanism of activation for these mutations is less clear; 
however, they do not confer ligand-independent'proUferation 
in BaF-B03 cells, suggesting a different mechanism to 
h^cV449E.'* We have proposed that these mutations may 
lead to an alteration in receptor structure that results in the 
unmasking of an interactive surface that is not normally 
available in the receptor monomer.^'-" We speculated further 
that ligand association leads to activation by exposing such 
a surface and allowing an interaction that can lead to signal- 
ing. One prediction from such a model is that extracellular 
truncation may lead to activation by unmasking an inter- 
active region and it is notable that the severely truncated v- 
Mpl product retains 43 amino acids of extracellular se- 
quence," including the two conserved motifs duplicated in 
h^cHA (Fig lA). 

In this context we have now tested the hypothesis that 
extracellular tnincati n could lead to constitutive activation 
of cytokine recept rs. We have examined a series of h^c 
extracellular recept r truncations for their capacity t confer 
growth in factor-dependent cell lines in the absence of 
growth factor. We find that infection of the murine myeloid 
cell line, FtXT-Pl, with retroviruses encoding truncated 
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fonns of h^c leads to factor-independent proliferation pro- 
vided domains 1-3 are removed and a conserved membrane 
proximal segment is retained. 

MATERIALS AND METHODS 

Construction and expression of hpc receptor mutations. The 
cDNA fonns used in this study were constructed by in vitro 
mutagenesis. Wc utilized a mutagenic oligonucleotide (5'CCCTG- 
TGCTCK3GTGCTGAGCGGCGCACAGGCAGACTACAAGGAC- 
GACGACGACAAGGAAGAAACCATCCCG3') to incotpoiatc the 
coding sequence for the FLAG octapeptide (Easunan- Kodak Co. 
New Haven. CT). DYKDDDDK, after the signal sequence (between 
residues A24 and E25) in the h^ cDNA. To optimize cleavage of 
the signal pepdde, residues 18-24 were altered from ERSLAGA 
to VLSGAQA. These modifications were confiimed by sequencing 
and this FLAG-h^c cDNA cloned into pALTER (Promega, 
Madison. WI) for further mutagenesis. Mutagenesis was per- 
formedwith mutagenic oligonucleotides designed to remove 
the desired extracellular sequence while leaving coding sequences 
for the N-terminal signal sequence and the inserted FLAG 
octapeptide intact Mutagenic oligos were as follows: h^cAN; 
S'GACGACGACGACAAGGTTIXJITGGGACTCCS'. hj3cAH; 5' 
GACGACGACXjACAAGCACAGCATGGCCCTGS', h^cAQP; 
5'ACGACGACGACAAGATCCAGATGGCCCCT3', h/3cAWS; 
5'ACGACGACGACAAGCGCTCCTGGGACACC3'. All mutants 
were obtained after screening DNA minipreparations from the ampi- 
cillin resistant colonies for an h^c fragment of diagnostic size. Muta- 
tions were confirmed by DNA sequencing, which was performed 
using T7 Polymerase (Sequenase; United States Biochemical Corp 
[USB. Qeveland, OHJ) as per the manufacturer's protocols. The 
receptor mutants generated in this study are described in detail in 
the text and are shown schemaucally in Rg IB, For expression, 
mutant receptors were excised from pALTER and cloned into the 
retroviral expression vector pRUFNeo.** Retroviral DNA was used 
to transfect the ecotropic packaging cell line. ^2." and virus from 
G418-resistant cells was used to infect the murine hematopoietic 
cell lines, FDC-Pl" or BaF-B03'' by cocultivation.*» 

Cell lines, growth factors, and flow cytometry. Murine myeloid. 
IL-3/GM-CSF-depeiident FDC-Pl cells, and derived cell lines were 
maintained in Dulbecco*s modified Eagle's medium (DMEM) sup- 
plemented with 7,5% fetal bovine scrum and murine GM-CSF (80 
U/mL; gift from Dr G. Begley. Walter and Eliza Hall Instimte for 
Medical Research. Melbourne Australia). Murine IL-3 -dependent 
BaF-B03 cells were maintained in DMEM supplemented with 7.5% 
fetal bovine serum and murine IL-3 (300 U/mL; gift from Dr A. 
Hapel. John Curtin School for Medical Research. Canberra, Austra- 
lia). After cocultivation with producer cells transfected with pRUF- 
Neo h^ constructs. FDC-Pl. or BaF-B03 cells were harvested and 
selected in die presence of G418 (1 mg/mL; Sigma- Aldrich Pty Ltd, 
Castle Hill NSW Australia) and growth faaor. Receptor expression 
was examined by flow cytometric analysis after staining with the 
FLAG-specific monoclonal antibody (MoAb). M2 (Eastman-Kodak 
Co). Briefly, cells were washed and resuspended in cold phosphate- 
buffered saline supplemented with 5% bovine serum albumin 
(PBSA). Cells were incubated with the M2 MoAb (1:300) tor 20 
minutes on ice, washed, and sut>sequently incubated wiih biodnyl- 
ated andmouse IgG (1:50; Vector Lab Inc. Burtingame. C:A) for 20 
minutes on ice. After washing and resuspension in cold PBSA the 
cells were incubated with strepuvidin conjugated phycoerythiin 
(1:50; Caltag Laboratories. San Francisco. CA) for a further 20 
minutes, washed, resuspended in PBSA -f 0.01% sodium azide, and 
analyzed using an Epics-Profile D analyzer (Couher. Hialeah, FL). 

MUogenic assays. To assay prolifcntioa, infected cells (FDC- 
Pl, BaF-B03) were washed three times with PBS to remove grxywth 
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wet) microliter plate in the presence and absence of the appropriate 
growth factor. Cell growth was determined after 72 hours using a 
Cell titer 96 nonradioactive cell proliferation assay (Promega. Madi- 
son. WI). Quantitation was performed using an automated plate 
reader (BioRad Laboratories, Ply Ltd. North Ryde. NSW Australia). 
To establish relative growth rates of cells expressing factor-indepen- 
dent mutants, cells were first selected for factor independence, seeded 
at 5.000 cells per well, and proliferation assayed, as described earlier, 
on each of the following 4 days. 

Preparation of cellular DNA and polymerase chain reaction 
(PCR) analysis. Genomic DNA was isolated from cells using a 
Proteinase K/SDS procedure.^' PCR was performed on 300 ng of 
genomic DNA using standard protocols." Internal h^ primers were 
used for amplification (STXiAATTCGCCTGTCCAGAGCTGAC- 
CAGGG 3' and S'TGGATCCTCTGTGGGTAGATCTGAGCKrAG 
3'). Reactions were performed in a Perkin Elmer Thermocycler and 
the cycling parameters were: 30 cycles— 94'*C for I minute. 60^ 
for 1 minute. 72^^ for 1 minute. Reactions were denatured at 94*^ 
for 2 minutes before cycling. PCR products were fractionated using 
1% agarose mini-gels and visualized by staining with ethidium bro- 
mide. 

RESULTS 

Construction and expression of mutant hfic cDNA, In 
vitro mutagenesis was used to construct the series of extra- 
cellular h)5c deletion mutants indicated in Fig IB, In each 
of these mutations a large portion of extracellular region was 
replaced by the FLAG octapeptide. The N-terminal FLAG 
octapeptide does not affect receptor binding or function in 
FDC-Pl cells (data not shown) and allows detection of al- 
tered receptors expressed on the cell surface using the 
FLAG-specific MoAb, M2. The mutant h^cAN completely 
removes domains 1 and 2 (CRMl, residues E25-E232 inclu- 
sive) whereas the mutant h;5cAQP removes domains 1-3 
(residues E25-N337 inclusive; the N-terminal CRM and the 
first domain of CRM2). h^AH is truncated to a site within 
domain 4 (residues E25-A394 deleted) and retains 45 hfic- 
derived, membrane proximal residues, including those that 
are duplicated in the constitutive h^c mutant, h^cFIA,^ and 
the equivalent region to that retained in v-Mpi. The mutation 
in h^cAWS removes residues E25-A431, including the 
membrane proximal WSXWS,, and leaves 7 residues be- 
tween the FLAG octapeptide and the transmembrane do- 
main. These modified hfic cDNAs were cloned into the ret- 
roviral expression vector pRUFNeo.^ 

Retroviral DNA was introduced into the retroviral packag- 
ing cell line, ^2, and stable pools of transfected ^1 cells 
were produced by antibiotic selection. To test the ability f 
each mutant to induce factor-independent growth, w in- 
fected the murine myeloid cell line, FDC-Pl, and the IL- 
3-dependent pro-B cell line, BaF-B03. We demonstrated 
surface expression f mutant recept rs n infected FDC-Pl 
cells by flow cytometry f 11 wing staining with th M Ab, 
M2 (Fig 2). All truncated recq>tors were detectable by this 
procedure, indicating that receptor tnincation did n t prohibit 
surface expression. 

Proliferation of infected FDC-Pl and BaF-BOS cells. 
Uninfected FDC-Pl cells and G4l8-rcsistant FDC-Pl 
cells infected with the h)9cAN r h)9cAWS rctrovinises 
grew in th presence of murine (m) GM-CSF but failed 
to proliferate after removal of growth factor. In contrast, 
POC-Pl cells infected with eith r the h^cAH r h)9cAQP 
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Fluorescence 

Rg 2. Expression of hfic mutants in FDC-PI cells. Cells were 

stoined wfth an irrelevant control antibody ( 1 or the onti-FLAG 

monoclonal antibody, iVI2 ( 1. Cell number and fluorescence are 

in arbitrary units with the laner plotted on a logarithmic scale. 



deletion mutants grew both in the presence and absence 
of growth factor (Fig 3A). The different degree of prolifer- 
ation observed in the absence of factor by cells infected 
with the h^cAQP and hyJcAH retroviruses probably re- 
flects the proportion of each G4 18 -resistant population 
that expresses h^c. To eliminate the possibility that these 
mutants confer different growth rates on FDC-Pl cells, 
we compared proliferation over a 4-day period after selec- 
tion for growth factor independence. As shown in Fig 3C, 
h^cAQP- and h^cAH-infected ceils did not grow at a rate 
significantly different from each other, or from FDC-Pl 
cells grown in the presence of mGM-CSF. To show that 
the hpc cDNA had not undergone a rearrangement in the 
factor-independent FDC-Pl cells derived from infection 
with h^cAQP and h^cAH retroviruses, we recovered part 
of the h^c cDNA spanning the deletion, by PCR from 
cellular DNA with h^c specific primers. DNA from G4I8- 
resistant FDC-Pl cells infected with h/3c retro- 
virus, and from factor-independent cells infected with 
pRUFh)9cAQP and pRUFh^cAH retroviruses generated 
the expected products of 1.726 bp. 814 bp, and 640 bp, 
respectively (Fig 4), consistent with h^c retroviral in- 
sert(s) in these cells containing the appropriately mutated 
h0c cDNA. In common with the h^cRA and h^cI374N 
mutants,^' none of the truncated receptor retroviruses was 
capable of conferring IL-3- independent growth when 
used to infect the murine pro-B cell line BaF.B03 
(Fig 5). 



DISCUSSION 

The possibility that cytokine receptors can be activated 
by truncation of their extracellular domain is suggested by 
the oncogenic v-Mpl product (see Fig lA). An thcr candi- 
dale for activation in this way is the common 0 chain (h/?c) 
for IL-3. GM-CSF, and IL-5, which like Mpi, has tw repeats 
of the cytokine receptor motif." The GM-CSF» IL-3, and IL- 
5 high-affinity receptors are composed of a ligand-specific a- 
chain (GMRa. IL3Ra, and ILSRa), with which Ihey f rm 
a low-affinity complex, and h^c."'^ Although h^c alone 
does not detectably bind cytokine it is responsible for affinity 
conversion and signal transduction.'* '" Residues in the mem- 
brane proximal domain (domain 4) of h^c have been identi- 
fied as critical for affinity conversion by h0c, presumably 
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Rg 3. Prolifeffition of FDC-Pl celts expressing mutant forms of 
h/Jc. (A) Infected FDC-Pl cells grown In the preoence (ligfit greyl or 
obsence (dortt greyl of mGM-CSF. ProOforotion ossoyo wero per- 
formed os described in Moteriols ond Metttods. FDC*P1 colls Inf octod 
with pRUFNeo or wfth on hficBA retrovirus^ ore sttown os controls. 
(8) A schemotic representation of tho h^c mutonts indudod In ttto 
obove ossoy- Conserved cystoino residues and WSXWS motifs oro 
shown. h^cRA contoins o dupRcoted oogment of 37 resitfuos In do- 
moin 4.*" lO Growth curve comportng l^^cAQP qtkI t^^cAH prolif oro- 
tion ovor 4 doys. Cells were solocted bof oro ossoy by growth In f octor- 
free modium. Proliferation of unbifected FDC-Pl coUs in ttie prosonco 
of mGM-CSF is shown for comporison. 
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Rg 4. PCR analysis of retroviral ONA In factor-independent cells. 
PCR from genomic DNA to detect h^c sequences in factor-indepen- 
dent FDC-P1 celU derived from infection wfth retrovirus encoding 
h^cAQP and hficAH (see text). PCR products were separated on a 
1% agarose gel alongside molecular-wetght markers (1-kb ladder; 
BRL Gaittiersburg, MD). The expected product sizes are 1,726 bp, 
814 bp, and 640 bp for h/5c, h/3cAQP, and h^cAH, respectively. Sizes 
shown on tlM left refer to molecular weight markers. 



through an inleraction with residues in the N-terminal helix 
of Ihe^wth factor.^ However, to date there is little infor- 
mation regarding the role of the first three domains of hfic 
in ligand binding. Several activating mutations cluster in 
domain 4 of h^c consistent with a critical role for this do- 
main in activation and signaling.^*-^ Despite considerable 
knowledge regarding the intracellular pathways involved in 
signalling via h^c, there is still an incomplete understanding 
of the stoichiometry of the active receptor complex and the 
precise role of the a and 0 subunits in signaling. 

To test the hypothesis that truncation can lead to ligand- 
independent cytokine receptor activation, we constructed a 
series of cDNAs encoding extracellular deletion mutants of 
h^c. Our results show that infection of the murine IL-3/ 
GM-CSF-dcpendent cell line, FDC-Pl, with retroviruses 
encoding two of these truncated forms of h^c (h^c AQP and 
h^cAH) leads to factor-independent growth. The inability 
of the other truncated forms of h0c Qi0cAN and h^c A WS) 
to confer factor independence in FDC-Pl cells was not due 
to a defect in transport to the cell surface as we could detect 
surface expression using an MoAb directed to the introduced 
N-terminal FLAG epitope. 

The observati n that hficAH confers factor independence 
while h^AWS docs not. defines an extracellular region 
of h^ (residues H395-A431) as essential for constitutive 
activaU n in FDC-Pl cells. It is notable that in v-Mpl. the 
c rresponding recqHor segment is retained. The mechanism 

f activation of the v-m^/ product remains unclear, h wever, 
based n our observations we suggest the loss of the Mpl 
N-fftrminal reffion is important and may expose a site of the 



or a heterodimer. The extracellular region retained in v-MpI 
contains the WSXWS motif and the conserved 0 strand F' 
(consensus Y/H XX R/Q VR). These motifs are also within 
the duplicated segment in the mutant h)3cFIA, which pre- 
viously led us to speculate that they are involved in receptor 
activation." The conservation of this membrane proximal 
segment throughout the cytokine receptor family suggests 
that it may play an important role in the activation mecha- 
nism of class 1 receptors, perhaps through interactions with 
conserved receptor subunits. Interestingly, a previous study 
with chimeric receptors indicated that the extracellular do- 
main of cytokine receptors determined, to some extent, the 
specific phosphorylation events that were triggered in re- 
sponse to ligand. suggesting interactions between extracellu- 
lar sequences and signal transducers.^ 

In the growth hormone/growth hormone receptor and 
growth hormone/prolactin receptor complexes the corre- 
sponding segment has been shown by crystallographic stud- 
ies to form part of an exposed, positively charged ^-sheet in 
the membrane proximal domain of the ligand-bound receptor 
complex.^-^ By homology the conserved residues in this seg- 
ment of h^c would be positioned distal to ligand and a- 
subunit in a ligand-bound complex. It is possible that dierc 
are two interactive surfaces in cytokine receptor subunits 
that are involved in formation of oligomeric complexes. Al- 
though h^c has not been shown to form a higher order 
complex, we have speculated previously^' that activation 
might occur through formation of ai02 or a0y type com- 
plexes. Thus, h^c may have at least two interactive surfaces: 
one for association with a specific a-chain and a second 
that interacts with either another h>5c or a third receptor 
component. Although there are several conserved residues 
between H395 and A431. it is not possible, from this work. 
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to define which residues are essential in receptor activation. 
In v-Mpl. mutation of the WSXWS motif does not impair 
pathogenicity of MPLV/ suggesting that other residues in 
this segment may be important 

The observation that expression of the mutant h^cAN 
does not confer factor independence in FDC-Pl cells implies 
that the third fibronectin-like receptor domain (domain 3) 
may have a role in inhibiting signaling by unoccupied recep- 
tor, presumably through an interaction with residues in do- 
main 4. Removal of this domain (as in h)3cAQP) is sufficient 
lo lead to constitutive activation. It is tempting to speculate 
that the B'C and PC loops of domain 4, which are im- 
portant in ligand binding^ (and Woodcock ct al, submitted 
for publication), interact with ligand in such a way that the 
orientation of domains 3 and 4 is altered, exposing the inter- 
active surface that allows higher order complex formation. 
In this way ligand may displace inhibitory interactions in 
the same way as truncation. 

Although the mutants h/?c AQP and h^c AH conferred fac- 
tor-independence on FDC-Pl cells, they did not confer fac- 
tor-independence on the IL-3-dependent pro-B cell line. 
BaF-B03 (Fig 5). Two other extracellular mutations! 
h/3cnA" and h^cI374N.^' similarly confer factor-indepen- 
dent growth in FDC-Pl cells but not in Ba/F3 cells. On the 
other hand the h^c mutant (h^cV449E). which resembles the 
neu oncogenic receptor, and presumably leads to ^-subunit 
dimcrisaUon, confers IL-3- independent growth on infected 
BaF-B03 ceils.'' Consistent with this, chimeric receptors 
comprised of the GMRa or IL5Ra extracellular domain and 
the h^c cytoplasmic domain give a ligand-dependent re- 
sponse in Ba/F3 cells expressing h^c.»*' « It appears from 
these results that h^c can be acUvated by homodimcrizaUon; 
however, the results presented here, and in the report by 
Jenkins et al.*» suggest that ligand-independcnt f rms of h^c. 
with extracellular muutions. utilize an alternative mecha- 
nism of activation that may require interaction with an un- 
known subunit present in FDC-Pl cells but absent from BaF- 
B03 cells. Taken together with the results from this study. 
It is reasonable to suggest that the interacUon of h/?c with 



such a factor could be mediated by residues within the seg- 
ment H395-A431. 

In Fig 6 we present a model for h^c activation. We 
propose that extracellular mutations of h^c (h^cFIA, 
h^cI374N. h)5cAQP. and h^cAH) that lead to ligand-inde- 
pendent activation may allow constitutive association with 
an unknown, myeloid-resiricted, receptor component (y) and 
may be mimicking a normal a0y or 0y complex. The fact 
that there are several extracellular activating mutations clus- 
tered in domain 4 is consistent with activation involving the 
disruption of an inhibitory conformation of the unoccupied 
form of h0c. In this conformation we would predict that a 
critical structural motif within domain 4 (involving the seg- 
ment H395-A431) is masked, possibly through interactions 
involving domain 3. We propose that the I374N substitution 
and the h)5cFIA duplication can interfere with the structure 
of domain 4 in such a way that they lead to the disruption 
of inhibitory interactions.^' By analogy with these mutants 
we suggest that association with ligand alters the monomeric 
h^c conformation and leads to association with other recep- 
tor components and the subsequent generation of intracellu- 
lar signals. 
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In tt« preceding p^[CIioo,Y.&Khig, a. 
(IW^Prtc- Natl. Acad. Sd. USA 91, 11163-11167], we 
nwwed how sefectkms from a library of sine flogm db|4ayed 
OB phage yielded fingrn abfe to bind to a Bomber of DNA 
tr^iets. Here, we describe a technique to deal eflideBtly with 
the canvene^oUeni-Hiamefy, the seledloa of a DNA bin^ 
stte for a givoi ifaic finger. This is done by scmnfaig •g^fawf 
Bbraries of DNA triplet Undhig sites randomlied hi two 
positkms bot having one base fixed in the third position. Hie 

tochnlqiie Is applied here to detemdne the specificity of fii«en 
previoosty selected by phJtge disiday. We fin^ 

fingers are able to spediy a unique base hi each position of the 
cognate triplet. This b ftuther fflnstrated by examples of 
fingers which can discrimfaiate between dooely related triplets 
as measured by their respective eqnfl&rtmn Hfaa^ wH afiftn con* 
stants. Oanpartng the amfaio add sequences of fingen which 
« partkuhu* base hi a triplet, we fail^ that fai most 
^traces, sequace-qpedfic bfaidhig of ifaic fingers to 

Deadilevedby using a small set of amino add-nudeo tide base 
contacts amenable to a code. 

In prindple, rules governing protein-DNA interactions can 
be deduced from a large database of correlations between the 
amino acid sequences of the proteins and the nudeotidc 
sequences of their optimal binding sites . To this end, we have 
shown in the preceding paper (1) that functionally equivalent 
zinc fingers which bind to a given DNA sequence can be 
selected from a phage display library. However, determina- 
tion of the optimal binding site for these fingers is stiU 
required, as a safeguard against spurious sdections. One can 
determine the optimal binding sites of these (and other) 
proteins, by selection from hbraries of randomized DNA. 
This approach, the principle of which is essentially the 
converse of zinc finger phage display, would provide an 
equaUy informative database from which the same rules can 
be independently deduced. However, until now the fovored 
method for binding-site determination, involving iterative 
sdection and amplification of taigct DNA followed by se- 
quencing, has been a laborious process not conveniently 
applicable to the analysis of a large database (2, 3). 

We present here a convenient and rapid method which can 
reveal the optimal binding site(s) of a DNA-binding protein 
by single-step selection from small libraries, and use this to 
check the binding-site preferences of those zinc fingers 
sdected previously by phage display (1). For this application, 
we use 12 different minilibraries of the binding site for 
transcriptioo factor Zif268, each one with the central trijrfet 
having one position defined with a particular base pair and the 
other two positions randomized. Each library therefore com- 
prises 16 oligonudeotides and offers a number of potential 
bmding sites to the middle finger, provided that the latter can 
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tolerate the defined base pair. Each zinc finger phage is 
screened against aU 12 Ubraries individually inmuriiilized in 
wdls of a microUter plate, and binding is detected by an 
enayme immunoassay. Hius, a pattern of acc^table bases at 
each position is disclosed, which we caU a binding-site 
signature. The information contained in a binding-site signa- 
ture enconv>asses the repertoire <rf binding sites recomized 
by a zinc finger. 

The bindiiig-site signatures obtained by using zinc finger 
ph^ge sdected as de soibed in the preceding paper (1) reveal 
that the selection has yielded some highly sequence-specific 
zinc fingers which discriminate at all three positions of a 
triplet. From measurements of equih*brium dissociation con- 
stants, we find that these fingers bind tightly to the triplets 
indicated in their signatures and discriminate against dosdy 
related sites usually by at least afactor of 10. The binding-site 
signatures aUow us to infer rules for a specificity code for the 
mteractions of zinc fingers with DNA. 

MATERIALS AND METHODS 

BfauUng^ite S^natnres. Flexible flat-bottomed 96-weU 
plates (Falcon) were coated overnight at 4*C with stieptavi- 
^ (0.1 mg/ml in 0.1 M NaHCQ,, pH 8.6/0.03% NaNj). 
WcUs were blocked by incubation for 1 hr with PBS/Zn 
(pho^>hate-buffcred saline plus 50 /iM zinc acetate) contain- 
ing 2% (wt/voO fet-free dried milk (Marvd) and were washed 
three times with PBS/Zn containing 0.1% Twecn and three 
tunes with PBS/Zn. The "bound'' strand of each oligonu- 
deoude Ubrary was made syntheticaUy and the other strand 
was extended from a 5'-biotinylated universal primer by 
DNA polymerase I (Klenow fragment). Products of fill-in 
reactions were added to wells (0.8 pmol of DNA hbrary in 
OM:h) m PBS/Zn for 15 min and then washed once with 
PBS/Zn containing 0.1% Twecn and once with PBS/Zn. 
Overnight bacterial cultures each containing a sdected zinc 
finger phage (1) were grown at 30^ in 2xTY medium 

^"/^f*^/? ^ 1^ '*« tetracycline per 

ml (2XTY/Zn/Tet). Culture supematanto containing phage 
were dduted 10-fold by addition of PBS/Zn containing 2% 
(wt/vol) fat^ dried milk, 1% (vol/voQ Twecn 20and2b iig 
of somcated salmon sperm DNA per ml. Dihited phage 
s^utions (50 Ad) were appUed to wells and binding was 
allowed to proceed for 1 hr at 20°C. Unbound phage were 
removed by washing five times with PBS/Zn containing 1% 
Tween and then three times with PBS/Zn. Bound phage were 
detected as described (4) or by using horseradish peroxidase- 
conjugated anU-M13 IgO (Riarmada) and quantitated with 
soFTMAX 2.32 (Molecular Devices). 
Detcnnfaiatlon of Apparent Equilibrium Dissociation Con- 
XlS^^' <>vc™«ht bacterial cultures were grown in 
2xTY/Zn/Tet at 30'C. Culture supcmatants containing 
phage were diluted 2-f Id by the addition of PBS/Zn con- 
taining 4% fat-frce dried milk, 2% Tweets 20. and 40 « of 
somcated saim n sperm DNA per ml. Binding reacti n 
nuxtures containing appropriate concentrations f specific 
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5'-bi tinylated DNA and equal volumes of zinc-finger phage 
solution were allowed to equilibrate for 1 hr at 2(rc. All DNA 
was captured n streptavidin-coated paramagnetic beads 
(500 Mg per well), which were subsequently washed six times 
with PBS/Zn containing 1% Tween and then three times with 
PBS/Zn. Bound phage were detected with h rscradish per- 
xidase-conjugated anti-M13 IgG (Pharmacia) and developed 
as described (4). Optical densities were quantitated with 
SOFTMAX 2.32 (Molecular Devices). 

Ki values were estimated by fitting to the equation K4 = 
[DNAI[protcin]/[DNA-i)rotcinJ with the program kaleida- 
ORAPH version 2.0 (Synergy Software, Riding, PA). Owing 
to the sensitivity of the ELISA used to detect protein-4>NA 
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complexes . we can use zinc finger phage concentrations far 
below those of th DNA, as is required for accurate calcu- 
lanons of K4. The technique we use has the advantage that 
while the concentrati n of DNA (variable) must bcto wn 
accurately, that of the zinc fingers (constant) need not be 
kn wn (5). This circumvents the problem of calculating the 
number f zinc finger peptides expressed on the tq) of each 
phage, although since only 1(^20% of the gene ffl protein 
(pill) carries such peptides, we would expect on average less 
than one copy per phage. Binding is performed in sohition to 
^vent any effects caused by the avidity (0 of phage for 
DNA mwiobilized on a surface. Moreover, in this case 
measurements of by ELISA are made possible because 
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equimmum breached in solution prior to caimire nthcsoUd 
phase. 

RESULTS AND DISCUSSION 
Binding-Site Signatore rfthe Sccood Zinc Filler <rfZi£268. 
Tlic top row of Fig. 1 shows the signature of the second finger 
of wild-type Zif268. Ftom the pattern of stnmg signals 
mchcatmg binding to oligonucleotide hbrarics haviog GNN 
TON, NGN and NNG as the middle triplet, U emeiges that 
the optmial bmding site for this finger is (T/0)OG, in accon! 
with the published consensus sequence (7). This has impli- 
cations fOT the intcipretation of the x-ray crystal structure of 
solved in complex with a consensus opeiHtor having 
TOG as the middle triplet (8). Far instance. His at position +3 
^ the middle finger was modeled as donating a hydrogen 
bond to N7 of guanine, suggesting an equivalent contact to be 
possible witii N7 of adenine, but torn the binding-site sig- 
nature we can see Uiat tiiere is discrimination against ade- 
nine. This implies that Uie His may prefer to make a hydrogen 
bond to 06 guanine or a bifurcated hydrogen bond to both 
06 and N7 or that a steric clash with the amino group of 
adenine may prevent a tight interaction witii tiiis base. Thus, 
Bom consideration of the stereochemistry of double-helical 
DNA, binding-site signatures can give insight into the details 
of zinc finger-DNA interactions. 

Amino Add-Nttdeotkle Base Contacts in Zinc FInger-DNA 
Complexes Deduced from BfauUng-Slte Signatures. The bind- 
mg-site signatures of otiier zinc fingers (Fig. 1) reveal tiiat tiie 
phage selections we performed in our previous study (1) have 
yielded highly sequence-specific DNA-binding proteins. 
Some of tiicsc are able to specify a unique sequence for the 
middle triplet of a variant Zi£268 binding site and are therefore 
more specific tiian is Zif268 itself for its consensus site. 
Moreover, one can identify tiie fingers which recognize a 
particular oligonudeotide library—^t is to say a specific 
^sc at a defined position--by looking down tiie columns of 
Fig. 1. By comparing the amino acid sequences of these 
fingers we can identify any residues which have genuine 
preferences for particular bases on bound DNA. With a few 
exceptions, tiiese are as previously predicted on the basis of 
phage display (1) and arc summarized in Fig. 2. 

The binding-site signatures also reveal an important feature 
of our phage display library which is crucial to tiie interpre- 
tation of our selection results. AU tiie fingers in our panel, 
regardless of tiie amino acid present at position +6, are able 
to recognize guanine or botii guanine and tiiymine at the 5' 
end of a triplet. Our explanation for tiiis is tiiat tiie 5' position 
of tiie middle triplet is fixed as eitiier guanine or tiiymine by 
a contact torn the invariant Asp at position +2 of finger 3 to 
the partner of eitiicr base on tiic complementary strand, 
analogous to tiiosc seen in tiie Zif268 (8) and tramtrack (9) 
crystal structures (a contact to tiie NH2 of cytosine or 
adenine, respectively, in tiic ni^or groove). Therefore Asp at 
position +2 of finger 3 is dominant over tiie amino acid 
present at position +6 of tiie middle finger, precluding tiie 
possibility of recognition of adenine or cytosine at the 5' 
position. Future libraries must be designed witii tiiis inter- 
action omitted or the position varied. Interestingly, given tiie 
namework of tiie conserved regions of tiie three fingers, we 
can identify a nile in tiie second finger which specifies a 
frequent interaction with botii guanine and tiiymine— 
namely, tiie occurrence of Ser or Thr at position +6, which 
may donate a hydrogen bond to either base. 

Modolation of Base Recognition by Auxiliary Pteltions. As 
we have n ted above, positi n +2 is able to specify tiie base 
directiy 3' of tiic "cognate triplet" and can tiius work in 
coiyunction with position +6 of tiie preceding finger. The 
binding-site signatures, while pointing to amino acid-base 
contacts trom the tiiree primary positions, indicate tiiat 
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Fio.2. Summary of fiequenUy observed amino add-nucJeoUdc 
Mse contacts m interactions of selected zinc fingers wiUi DNA The 
given ccmtacts comprise a "syllabic" recognition code (see text) tor 
ap^pnate triplets. Cognate amino acids and their positions in the 
a^te^ are entered in a matrix relating each base to each position of 
a tnplct. Auxiliary amino acids from position +2 can enhance or 
inodulate specificity of amino adds at petition -I, and titese are 
hsted as Ser or Thr at position +6 permit Asp at +2 of die 
foUowing finger (denoted Asp++2) to specify both guanine (O) and 
thynu^ m indirectly, and tiic paii3 are listed. TTic specificity of Ser 
at +3 for T and Thr at +3 for cytosine (Q may be interchangeable 
^ ^'''^ ^ ^ consistenUy 

auxiliary positions can play otiier parts in base recognition. 
A clear case m point is Gin at position -1, which is specific 
for adenine at tiie 3' end of a triplet when position +2 is a 
small nonpolar amino acid such as Ala but is specific for 
tiiymine when a polar residue such as Ser is at position +2 
The strong correlation between Aig at position -1 and Asp 
at position +2, tiie basis of which is understood ftbm tiie 
x-ray crystal structures of zinc fingers (8, 9), is anotiier 
mstance of interplay between tiiese two positions. Thus tiie 
ammo acid at position +2 is able to modulate or enhance tfie 
specinaty of the amino acid at other positions. 

At IKwition +3, a different type of modulation is seen in tiie 
case of Thr and Val, which most often prefer cytosine in tiie 
middle position of a triplet, but in some zinc fingers are able 
to recognize botii cytosine and tiiymine. This ambiguity 
occurs possibly as a result of different hydrt)phobic intcrac- 
^2"® *® «^"P® of tiiese residues, and here 

a flexibihty in tiie incUnation of tiie finger ratiicr tiian an effect 
from anotiier position per se may be tiic cause of ambiguous 
reading. ^ 

Qoantitatfre Measorements of Dissodatfoo Constants. The 
binding-sitc signature of a zinc finger reveals its differential 
base preferences at a given concentration of DNA. As tiie 
concentration of DNA is altered, one can expect tiie binding 
site signature of any clone to change, being more distinctive 
at I w [DNA], and becoming less so at higher [DNA] as tiie 
Ad of less fav rable sites is approached and further bases 
become acceptable at each position f tiie triplet. Furtiier 
because two base positions are randomly occupied in any ne 
library f oligonucleotides, binding-site signatures are not 
formally able to exclude tiie possibility f context depen- 
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dcncc f r some intcracd ns. Therefore to supplement bind- 
mg-sitc signatures, which are essentially comparative, quan- 
titative determinations f the values of each phage for 
different DNA binding sites are required. After phage display 
sdecti n and binding-site signatures, this is the thiid and 
definitive stage in assessing the spedficity of zinc fingers. 

Examples of such studies presented in Fig. 3 reveal that 
zinc finger phages bind the operators indicated in their 
binding-site signatures with values in the range of lO"" to 
10~* M and can discriminate against closely related binding 
sites by factors greater than an order of magnitude. Indeed, 
Rg. 3 shows such differences in afSnity for binding sites 
which differ in only one out of nine base pairs. Since the zinc 
fingers in our panel were selected from a library by noncom- 
petitive aflRnity purification, there is the possibility that 
fingers which are even more discriminatory can be isolated 
by a competitive selection process. 

Measurements of allow different triplets to be ranked in 
order of preference according to the strength of binding. The 
examples here indicate that the contacts from cither position 
"1 or +3 can contribute to discrimination. Also, the ambi- 
guity in certain binding-site signatures referred to above can 
be shown to have a basis in the equal afi^ty of certain fingers 
for closely related triplets. This is demonstrated by the Ad 
values of the finger containing the amino acid sequence 
RODALTSHER for the triplets TTG and GTC. 

A Code for Zinc Fhiger-ONA Recognltkm. One would 
expect that the versatility of the zinc finger motif will have 
allowed evolution to develop various modes of binding to 
DNA (and even to RNA) which wiU be too diverse to fall 
under the scope of a single code. However, although a code 
may not apply to all zinc finger-DNA interactions, there is 
now convincing evidence that a code applies to a substantial 
subset. This code will fall short of being able to predict 
unfailingly the DNA binding-site preference of any given zinc 
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finger from iu amin acid sequence but may yet be suflS- 
ciently comprehensive to allow the design of zinc fingers with 
specificity for a given DNA sequence. 

Using the selccti n methods of phage display a) and of 
tnnding-site signatures, we find that in the case of Zif268-like 
zinc fingers, DNA recognition involves four fixed principal 
fthrec i»imary and one auxiliary) positions on the a4xelix» 
from which a limited and specific set of amino actd-faase 
contacts result in recognition of a variety of DNA triplets. In 
other words, a code can describe the interactions of zinc 
fiilgcrs with DNA. Toward this code, we can propose amino 
acid-*ase contacts for almost all the entries in a matrix 
relating each base to each position of a triplet (Fig. 2). Where 
there IS overiap, our resulu complement those of Destjariais 
and Beig (10, 11), who have derived similar rules by altwing 
zinc finger specificity, using database-guided mutagenesis 

Combinatorial Use of the Coded Contacts. The individu^ 
base contacts listed in Fig. 2, though part of a code, may n t 
always result in sequence-specific binding to the expected 
base triplet when used in any combination. First, we must be 
aware of the possibiUty that zinc fingers may not be able to 
recognize certain combinations of bases in some triplets by 
use of this code, or even at all. Otherwise, the m^rity of 
inconsistenaes may be accounted for by considering varia- 
Uons in the inclination of the trident reading head of a zinc 
finger with respect to the triplet with which it is inteiactiog. 
It appears that the identity of an amino acid at any one 
o-helical position is attuned to the identity <rfthe residues at 
the other two positions to allow three base contacts to occur 
simultaneously. Therefore, for example, in orxler that Ala 
may pick out thymine in Uie triplet GTG, Arg must not be 
used to recognize guanine from position +6, since tiiis would 
distance the Ala residue too far from the DNA (see for 
example tiie finger containing the amino acid sequence 
RODALTSHER). Second, since die pitch of the o-helix is 3 .6 
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amino acids per turn, positions -1, +3. and +6 arc not an 
mtegral number f turns apart, so that positi n +3 is nearer 
to the DNA than is -1 r +6. Hence, for example, sh rt 
amm acids such as His and Asn, rather than the longer Ar« 
and Gin, arc used f r the recognition of purines in the middle 
position of a triplet. 

As a consequence of these distance effecu, we might say 
that the code is not really '•alphabetic*' (always identical 
aminoadd-base contact) but rather "syUabic" (use of a small 
repertoire of amino acid-base contacts). An alphabetic code 
would involve only four rules, but syllabidty adds an addi- 
Uonal level of complexity, since systematic combinations of 
rules comprise the code. Nevertheless, the recognition of 
each triplet is still best described by acodc of syllables, rather 
than a catalogue of ''logograms'* (idiosyncratic amino acid- 
base contact depending on triplet). 

Conclusions. The syllabic code of interactions with DNA is 
made possible by the versatile framework of the zinc finger* 
this allows an adaptability at the interface with DNA by slight 
changes of orientation, which in turn maintains a stoichiom- 
^ of one coplanar amino acid per base pair in many 
different complexes. Given this mode of interaction between 
amino acids and bases, it is to be expected that recognition 
of guanine and adenine by Arg and Asn/GIn, respectively, is 
an unportant feature of the code; but remarkably, other 
mteractions can be more discriminatory than was anticipated 
(12). Conversely, it is clear that degeneracy can be pro- 
grammed in the zinc fingers in varying degrees, allowing for 
mtricate interactions with different regulatory DNA se- 
quences (7, 13). One can see how this principle makes 
possible the regulation of differential gene expression by a 
lumted set of transcription factors. 

As we have noted, the versatUity of the finger motif will 
most likely allow other modes of binding to DNA. Similariy . 
we must take into account the maUeability of nucleic acids 
such as was observed in ref . 9, where a deformation of thi 
double hchx at a flexible base step aUows a direct contact 
frorn Ser at position +2 of finger 1 to a thymine at the 3' 
posiUon of the cognate triplet. Even in our selections there 
are mstances of fingers whose binding mode is obscure and 
may require structural analyses for clarification. Thus, water 
may be seen to play an important role, for example, where 
short side chains such as those of Asp, Asn. or Ser interact 
with bases from position -1 (14, 15). 

Eventually, it might be possible to develop a number of 
codes descnbing zinc finger binding to DNA, which could 
predict the binding-site preferences of some zinc fingers fi^m 
their ammo acid sequences. The fiinctional amino acids 
selected in this study at positions -1, +3. and. to some 
extent. +6 are very frequently observed at the same positions 
in naturally occurring fingers (e.g.. see figure 4 of ref. 16) 
supporting the existence of coded contacts from these three 
positions. However, the lack of definitive predictive methods 
IS not a serious practical limitation, as current laboratory 
techniques (this paper and refs. 2 and 3) will allow the 
identification of binding sites for a given DNA-binding pro- 
tein Rather, we can apply phage selection and a knowledge 
of the recognition rules to the converse problem, the design 
of proteins to bind predetermined DNA sites. 
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Prospects for the Design of DNA-BfaMttng PMefais. The 
ability to manipulate the sequence specificity f zinc fingers 
imphes that we are on the eve of designing DNA-binding 
protems with desired specificity for api^ications in mcdkine 
and research (11. 17). This is possible because of the modular 
naturcofth zinc finger, by contrast to all other DNA-binding 
mottfs, smcc DNA sites can be recognized by appropriate 
combinauons of independenUy acting fingers linked in tan- 
dem. - . . : 

The coded interactions of zinc fingers with DNA can be 
used to model the specificity of indivklual zinc fiogers de 
novo or, rnore likely, in conjunction with phage display 
selection of suitable candidates. In this way, according to 
requirements, one could modulate Uie affinity for a given 
binding site or even engineer an appropriate degree of indis- 
cnminatton at particular base positions. Moreover, the ad- 
ditive effect of multiply repeated domains offers tiie oppor- 
tumty to bmd specificaUy and tightiy to extended, and hence 
very rare, genomic loci. Thus, zinc finger pixrteins might weU 
be a good alternative to the use of antiscnse nucleic acids in 
suppressing or modifying the action of agiven gene, whether 
normal or mutant. To this end, extra fimctions could be 
mtroduced into these DNA-binding domains by appending 
smtablc natural or synthetic effectors. ^ 
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is distinct rigid body movement with respect to the B pentamer. 
Residues A: I -222 (that is. the Al fragment plus the long N- 
terminal helix of A2) move by a 5" rotation, with residues 
216-222 funaioning as a hinge (Fig. 3o). This type of roution 
is also found in other crystal f rms f LT without bound sugars 
( ur unpublished results), but the remarkable flexibility may be 
important for translocati n f LT and cholera toxin across the 
membrane. 

One unexpected result was the well-defined density for five 
C-terminal residues of the A2 fragment (comprising residues 
A: 196-240), which were poorly defined in the native map at 
2.3- A resolution". In the LT-lactosc complex, residues A2 : 232- 
235 adopt a helical conformation (Fig. 36). Hence the unusual 
A2 fragment consists of (1) a long N-lcrminal helix, interacting 
with the Al enzyme; and (2) an extended chain, comprising 
residues 227-231, protruding through the central pore of the B 
pentamer along the 5-fold axis; and (3) a one-and-a-half turn 
C-terminal helix (Fig. 3c). Temperature factors are relatively 
high for residues 232-235 (on average 45 A' vereus an average 
of 27 A' for the entire A2, and 18 for the B pentamer) and 
hence this helix may be poised for conformational change. The 
C-terminal helix is also found in a higher-resolution native LT 
structure based on 1.95 A dau (T.K.S. et al, manuscript in 
preparation) and therefore seems to be an intrinsic feature of LT. 

The orientation of the bound galactose (Fig. 3a) is such that 
the CI linker atom is in the middle of the 'convoluted* surface 
of the B pentamer, about 8 A horizontally away from the side 
of the pentamer and at least 25 A vertically away from the 
opposite 'flat' or 'A-binding* surface. This distance is too far 
for the remaining sugar residues of the Gmi oligosaccharide to 
span because they extend 25 A at most from the hydrophobic 
part of the membrane^*. Binding with the A subunit facing the 
membrane would not only force the A subunit into the mem- 
brane but also insert the hydrophilic B penUmer into the hydro- 
phobic part of the membrane by at least 8 A, in contradiction 
to biochemical data"*•"*»^ As there is little or no evidence for 
any large conformational change in the B pentamer upon Gmi 
binding , the initial binding of LT or cholera toxin to Gmi- 
conuining membranes most probably occurs with the A subunit 
pointing away from the membrane, when all five ganglioside 
binding sites are occupied. This mode of binding has already 
been proposed'-'** but others have come to different con- 
clusions"-'^". It should be noted that here the C terminus of 
A2 interacts with the membrane (Fig. 3), in good agreement 
with photoaffinity labelling studies'**. The C-terminal sequence 
of A2 (ArgAspGluUu) closely resembles the LysAspGluUu 
sequence in cholera toxin; this KDEL sequence may act as a 
retention signal in the endoplasmic reticulum membrane" and 
could be important for the interaction of both toxins with the 
cell membrane. 

Finally, it has been pointed out to us that the topology of 
monomcric Staphylococcus aureus nuclease" is essentially the 
same as that of the B subunit of LT, although the relative 
onentation of the secondary-structure elements is different (A. 
Muzzin, personal communication). But the position of the galac- 
tose-binding site described here corresponds roughly with the 
active site of the nuclease, which may be of interest from the 
point of view of evolution in that LT, cholera toxin and 
the nuclease are all secreted by bacterial pathogens. □ 
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SeBection of single-strand d DNA 
molecules that bind and inhibit 
human thrombin 

Louis C. BocK Linda C. Griffin, John A. Latham. 
Eric H. Vermaas & John J. Toole* 

Gilead Sctences Inc., 346 Lakeside Drive, Foster City 
California 94404. USA 

Aptamers' are double-stranded DNA or single-straoded R^ 
molecules that biod specific molecular targets. Large random 
generated populations can be enriched in aptamers by in vit 
selection and polymerase chain reaction*'". But so far sing, 
stranded DNA has not been investigated for aptamer properti' 
nor has a target protein been considered that does not Inten 
physiologically with nucleic add. Here we describe the isolati 
of single-stranded DNA aptamers to the protease thrombin of I 
blood coagulation cascade and report binding affinities in the ran 
25-200 nM. Sequence data from 32 thrombin aptamers, select 
from a pool of DNA containing 60 nucleotides of random sequen* 
displayed a highly conserved 14-17-base region. Several of the 
aptamers at nanomolar concentrations inhibited thrombin-cat 
lyscd (ibrin^lot formation in vitro using either purified fibrinog 
or human plasma. 

Wc synthesized a pool of - 10'' 96.mcr oligodcoxyribonuclc 
tides that share I8-nuclcotidc binding sites for polymerase ch£ 
reaction (PGR) primers at their 5' and 3' termini and also cont£ 
60-nuclcotidc randomly generated sequences. Using a 5'-biotir 
latcd primer for one strand, this pool was amplified and rad. 
labelled by PGR. The nonbiotinylated strand was isolated frc 
its complementary strand after application to an avidin-agarc 
column and base denaturation'^ Single-stranded DNA was th 
applied to a concanavalin A (con A)-agarose column to rcmo 
DNA with affinity for con A, and the eluenl applied to hum 
thrombin immobilized on con A. After washing unb und D^ 
from the column, thrombin-DNA complexes were eluted w 
a con A ligand, a-methylmannoside. Fractions containi 
thrombin were identified using a chromogenic substrate ass 
and DNA from those fracti ns was quantitated bcf re Pi 

• To whom oorresponoence thouM be ad*eued. 
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TABLE 1 Sequence specificity of thrombin inhibition 



Clotting time (s) 

Sample Sequence Concentration Punfied fibrinogen Human plasma 

NoDNA - - 25±1 25±1 

aone29 96-mef lOOnM 76±3 50±2 

Unselected ONA 96-n>er 100 nM 26 ±1 26 ±1 

Consensus 15^r GGTTGGTGTGGTTGG 100 nM 169 ±8 43 ±2 

Saambled 15-mef GGTGGTGGTTGTGGT 100 nM 26 ±1 26 ±1 

Consensus 6-mer GGTTGG 20 42 ±2 40 ±2 

Scrambled 6-mer TGGGGT 20 /iM 26 ±1 26 ±1 



DNA was incubated for 1 min at 37 X in either selection buffer (0.2ml: Fig. 1 legend) containing human fibrinogen (Sigma) or fresh human plasma. 
Thrombin (0.1 ml in selection buffer pre-equilibrated to 37 X) was added to give a final concentration of 13 nM thrombin and the indicated concentrations 
of ONA: the final concentration of purified ftorinogen was 2mgmr^. Dotting times were measured using an automated ftorometer. Human Wood was 
obtained by venipuncture, anlicoagulated with the addition of 0.1 volumes 3.8% sodium citrate and fractionated by centrifugation (at 2.000g for 5 min). 
Plasma was decanted and stored (for less than one day) at 4 X. Clotting times are the average of three experiments. 



amplification and repeated selection. The scheme for the selec- 
tion and amplification cycle is shown in Fig. 1. 

Only 0.01% of the input DNA cluted with thrombin during 
the first selection cycle; this percentage increased in subsequent 
rounds of selection to -40% by selection cycle 5. DNA from 
selection cycle 5 was assayed for thrombin binding specificity 
on nitrocellulose fillers. The results showed that a significant 
fracti n (--30%) of aptamer DNA bound to thrombin but not 
to either ovalbumin or human fibrinogen (data not shown). 
Cycle-5 DNA was cloned in order to test the affinity of individual 
aptamers for thrombin, and radiolabelled single-stranded DNA 
from several clones was used to measure dissociation constants 
(/Ca). Several clones gave «=200nM. whereas the original 
pool of DNA showed little affinity for thrombin (data not 
shown). 

We determined the DNA sequence of the 60-nucieotidc ran- 
domly generated region from 32 clones to examine both the 
heterogeneity of the selected population and to identify 
homologous sequences. Sequence analysis showed each of the 
32 clones to be distinct, but there was a striking sequence 
conservation evident in every clone. The hexamer GGTTGG 
was found at a variable location within the 60-nucleotide ran- 
domized region in 31 out of 32 clones. The remaining clone 
contained the sequence TGTTGG. a match of five out of six 
bases. Additionally, in 28 of the 32 clones, a second hexamer 
GGNTGG was located 2-5 nucleotides 5' or 3' from the hexamer 

FKi 1 Scheme for the selection of DNA aptamers to human 
thrombin. 

METHODS. 96-nr)ef DNA was prepared by solid-phase phos- 
phoramicfite chemistry on a Biosearch 8600 synthesizer using 
an ec^imolar mixture of the four bases for the GO-nucteotide 
random portion of the sequence wWch is flanked by defined 
18-nudeotide regions that allow for PGR priming t>y the 
following oligonudeotides: 5'-CGTACGGTCGACGCTAGC-3' and 
5' bioUn-QGATCCGAGCTCCACGTG-3' (biolinylation reagent 
from NEN). The synthetic DNA was purified by P^(SE and 
amplified 100-fold by targe-scale PGR to generate a sequence 

Itt)fary with a complexity >10** in<fivkJual sequences*. DNA ^_ 

from this lil)rary was then amplified and ratfiolabencd by PCR 

under standard conditions in the presence of GO^D [a- 

"PjdNTPs. The resulting bioUnylated (asterisk) douWe- 

stranded DNA was applied to an avldin-agarose (Vector Labs) 

column (>o) equilibrated with O.IM Tris-«a, pH 7.5. O.IM 

Naa. ssDNA was then eluted in 0.15N NaOH (ref. 12). The 

sample was neutralized with acetic add, oortcentrated. ar>d 

the DNA precipitated with ethanol. Followtng cenuifugation, 

ONA was redissotved in selection buffer (20 mM Tris-acetate. 

pH 7.4, 140 mM NaQ. 5mM KCll mM CaCta, 1 mM MgO,) 

and 100 pmol applied to 1 ml concanavaltn A-agarose (Vector 

Labs) equilOyated in selection buffer. The ftovtf-through was 

applied to 6 nmol of human thrombin (Sigma) immobolized on 1 ml con- 

canavalin A-agarose equilOirated in setectton buffer. After washing with ' 

several column volumes of selection buffer. thromUrvDNA complexes were 

eluted by the adcfition of O.IM a-methytmannoside to the selection buffer 

w»rK c^-w^ti/v^c /v%nfainintf thrrimhin wftre Identified usln« 8 chromogcnic 



GGTTGG. The DNA sequence from the region of conservation 
from all 32 clones is presented in Fig. 2. Except for four 
clones that contain a close variation, the sequence 
GGNTGGNj.sGGNTGG is conserved. A compilation of the 
data relating base frequency to position is also shown in addition 
• to the derived consensus sequence. DNA sequencing of clones 
(>20) from the unselcctcd DNA population or from a popula- 
tion of aptamers selected for binding to a different target revealed 
no homology to the thrombin-selccted aptamers (data not 
shown). These results indicate that this consensus sequence is 
responsible, cither wholly or in part, for conferring thrombin 
affinity on the 96-mcr oligodeoxyribonucleotide. 

We next analysed whether the aptamer DNA could inhibit 
the thrombin-catalysed conversion of fibrinogen to fibrin using 
either purified human fibrinogen or adult human plasma. Using 
clone 29 DNA (Fig. 2) and purified fibrinogen, fibrin formation 
was detected at 76 seconds, compared with 26 s with unselcctcd 
DNA or 25 s in the absence of DNA (Table 1). More impress- 
ively, with respect to oligonucleotide size and potency, fibrin 
formation using the 15-mer GGTTGGTGTGGTTGG (con- 
tained in clones 15 and 29) was detected at 169 s. The 6-mer 
GGTTGG also inhibited thrombin, although a higher concentra- 
tion was required. No inhibition was seen with 15-mcr or 6-mcr 
scrambled sequence controls. The aptamers had anti-clotting 
activity in human plasma, although they were less potent (Table 
1). This difference in potency may be attributable to nuclease 

pGEMctoning 



PGR 



^ agaroM 



Synthetic SS-mof 



DNA 




Thrombin- ONA 




Thfombir^-con A-agareso 



Unbound DNA 



pelleted t>y centrifugation, redissotved. and subjected to further cycles of 
amplifcation arxJ selection. After the fifth selection cyde, DNA was amplified 
l>y PCR using the primers 5'-CGTACGGTCX5ACGCTAGC-3' and 5'-TAATAO 
GACTCACTATAGGGATCOGAQCTCCAOGTG-r, and subsequently doned using 
the plasmid pC£M3Z (Promega). Single-stranded S&^mer ONA was then 
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FK5. 2 Thronnbin aptanier sequence homology. DNA from the fifth cycle of 
thrombin selection was amplified by PCR and cloned (see Fig. 1 legend). The 
sequences of the rarxJomly generated 60-mer inserts were determined for 
32 clones using dideoxynudeotide chain termination. A region of homology 
was identified shown in bold upper case. Below is a tabulation of the data 
and the deduced consensus sequence. 



100 




0-1 , — I — " — ' — 

-9 -8 -7 -6 

log (DNA) (M) 

FKx 3 Per cent thrombin inhibition versus ONA concentration. 
METHODS. Selection buffer containing human fibrinogen (2mgn^l'* final) 
and varying concentratior\s of DNA {- 1 nM-150 nM) was incubated for 1 min 
at 37 "C before adding thron^in as described in Table 1 legend. Dotting 
times were measured using an automated ftoronr^eter. The extent of thrombin 
inhibition was then calculated using a thrombin standard, curve generated 
by measuring dotting tinr>e versus thrombin concentration. 



action or to binding of prothrombin or other plasma proteins' I 
present in much higher concentrations than thrombin. When | 
the 6-mer was tested at much higher concentrations it was still J 
as potent, perhaps because plasma nucleases and/or binding ; 
proteins were saturated. Tlius. this protocol enabled us to isolate i 
single-stranded DNA molecules that inhibit catalysis by throm* * 
bin. Moreover, this activity is retained by a shon consensus ,* 
sequence derived from the selected 96-mcrs. 

We next determined the extent of thrombin inhibition with • 
varying DNA concentration and found thai thrombin was 
inhibited by 50% at 25 nM (Fig. 3). This is not a true inhibition 
constant, but it demonstrates the potency of the thrombin 
aptamer. Also, the data indicate that stoichiomciry could be I : | 
and that each bound thrombin is largely, if not completely, 
inhibited. 

Our findings show that a population of -^10'^ molecules of 
96-mcr single-stranded DNA can be scleaed for aptamers that 
bind human thrombin, a protein with no known nucleic acid- 
binding function. As thrombin is a glycoprotein, we have been 
able to use lectin-agarose to immobilize it and a-mcthylmanno- 
side to elute the thrombin-DNA complexes. Initially we bound 
DNA to thrombin that was covalently linked to agarose. 
Denaturing elution with EDTA gave single-stranded DNA with 
affinity for the matrix and hence only a modest enrichment of 
thrombin aptamers. These results suggest that the conditions 
under which aptamers are cliited from a covalently bound target 
arc crucial to the successful isolation of high-affinity aptamers. 
One way to circumvent these problems may be to elute aptamer- 
targct complexes from the matrix. Once isolated from matrix- 
associated oligonucleotides, the' complexes can be fully 
denatured and the aptamers with the highest affinity recovered. 
In view of our success in isolating high-affinity aptamers to 
thrombin, we believe that lectin immobilization may provide an 
aptamer selection technique applicable to a large variety of 
glycoproteins. 

The basic nature of thrombin may also have facilitated the 
isolation of aptamers; apart from the. catalytic site, thrombin 
also contains an anion-binding exositc which binds 
fibrinogen'^"'^ and a binding site for heparin, a polyanion. In 
vitro selection should allow the isolation of aptamers to most 
proteins, although the affinity may be highest for proteins rich 
in basic residues. The strong sequence dependence shown in 
Fig. 3 and in Table I suggests, however, that electrostatic interac- 
tions may be necessary but not sufficient for high-affinity throm- 
bin binding. 

It may be that the affinity of an aptamer for its ligand is 
comparable to that of an antibody for its antigen. The chemical 
nature, size and mode of isolation of aptamers may sometimes 
offer advantages over existing antibody technology. We are at 
present investigating the aptamer-binding site on thrombin and 
analysing the binding sequences of individual aptamers in an 
effort to understand the base relationships that mediate 
binding and inhibition, our long-term interest being to develop 
diagnostics and therapeutic agents. D 
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TRANSDUCTION INTERMEDIATES 

Statement as to Federally Sponsored Research 
This research has been sponsored in part by grant number AI27849 
from the National Institutes of Health. The U.S. government has certain rights 
to this invention. 

Background of the Invention 
The invention relates to methods of identifying polypeptides and 
compounds which regulate gene expression. 

Pharmaceuticals have historically been developed by testing libraries 
of up to several thousand compounds in laboratory animals, usually one 
compound at a time. The slow pace of the process, and its unsuitability for 
screening large numbers of diverse compounds, led to the development of 
approaches based on assays that can be completed quickly and ex vivo. With 
these approaches, the pharmaceutical drug discovery process has evolved into a 
catenation of several, sometimes partially concurrent, phases. 

In the first phase, target identification and validation, a candidate 
target for a potential drug is identified by various means. These means may 
include hypotheses formed from the study of the pathophysiology of disease in 
humans or experimental animals, analysis of candidate signal transduction 
pathways in vitro, natural experiments such as genetic disorders of humans or 
other animals, results from targeted or random gene disruptions in model 
organisms, or disclosures by competitors. The projected consequences of a 
hypothetical drug interacting with its intended target to stimulate, block or 
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modulate the target activity is then tested. If the outcome of testing suggests 
that a drug affecting the target would have the desired physiological 
consequences in an intact organism, the target is said to be validated. Neither 
target identification nor validation are formally required, and, indeed, the 
historical process had neither of these phases. 

Present models for pharmaceutical industry productivity' emphasize 
the importance of efficiently identifying large numbers of validated targets and 
of developing rapid screens for their activity'. The impetus for this model is the 
widespread belief that targets cannot be predicted effectively, so that a large 
a 10 number of targets must be evaluated to develop a small number of drugs, 
p Hence, there remains an important need across the industry to accelerate the 

^ pace of target discovery and validation. 

Once the target has been identified and validated, an assay must be 
created to allow large libraries of synthesized or natural compounds to be tested 
315 for their ability to interact with the target. WTien a compound is identified 
□ which acts on the target specifically, it is usually said to be a hit. The definition 

of a hit generally encompasses compounds that pass various secondary tests to 
assure that their activity' is specific to the target of interest. From the various 
hits that are generated, one or more structures are chosen to represent the 
20 starting point for a program of systematic modification of the chemical 

structure. These stnictures are called lead compounds, and they are frequently 
selected from hits on the basis of their compatibility with directed synthesis 
programs, expected toxicity, or expected absorption, distribution, metabolism 
or excretion characteristics. Again, the process encompasses a wide range of 
practices and although they are usually distinguished, hits and lead compounds 
are sometimes referred to in ways that make them appear equi\'alent. 
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Summarv of the Invention 
In general, the invention provides a novel, rapid method for 
identifying members of selected signal transduction pathways v^ hich are targets 
for drug design. 

In a first aspect, the invention features a method for identifying a 
polypeptide which increases gene expression from a promoter, including (a) 
contacting a library of polypeptides with a cell that expresses a recombinant 
anti-cell death gene and that contains a reponer gene operably linked to a 
promoter, whereby expression of the reporter gene is increased if the library 
includes a polypeptide which increases gene expression from the promoter; (b) 
determining whether the reporter gene expression is increased in the cell as a 
result of contact with the polypeptide librar}'; and (c) if reporter gene 
expression is increased, identifying a polypeptide of the library which increases 
reporter gene expression. 

In a second aspect, the invention features a method for identifying a 
polypeptide which decreases gene expression from a promoter, including (a) 
contacting a library of polypeptides with a cell that expresses a recombinant 
anti-cell death gene and that contains a reponer gene operably linked to the 
promoter, whereby expression of the reporter gene is decreased if the library 
includes a polypeptide which decreases gene expression from the promoter; 

(b) determining whether reporter gene expression is decreased in the 
cell as a result of contact with the polypeptide library; and 

(c) if reporter gene expression is decreased, identifying a polypeptide 
which decreases reponer gene expression. 

In a third aspect, the invention feamres a method for identifying a 
polypeptide which modulates activation of a transcription factor activation 
domain, including (a) contacting a library of polypeptides with a cell that 
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expresses a recombinant anti-cell death gene and a chimeric transcription factor 
consisting of a yeast or bacterial DNA binding domain fused to a mammalian 
transcription factor activation domain, and that contains a reporter gene 
operably linked to a promoter consisting of a basal promoter and binding sites 
for the DNA binding domain, whereby expression of the reporter gene is 
altered if the library comprises a polypeptide which modulates acti\ ation of the 
transcription factor activation domain; (b) determining whether reporter gene 
expression is altered in the cell as a result of contact with the polypeptide 
library; and (c) if reporter gene expression is altered, identifying a polypeptide 
which modulates reporter gene expression. 

In a fourth aspect, the invention features a method for identifying a 
compound which modulates gene expression from a promoter, including (a) 
contacting a library of compounds with a cell that expresses a recombinant anti- 
cell death gene and that contains a reporter gene operably linked to the 
promoter, whereby expression of the reporter gene is altered if the library 
includes a compound which modulates gene expression from the promoter; ' 

(b) determining whether reporter gene expression in the cell is 
altered as a result of contact with the compound library; and 

(c) if reporter gene expression is altered, identifying a compound 
from the library which modulates reporter gene expression. 

In a fifth aspect, the invention features a method for identifying a 
compound which decreases gene expression, including (a) contacting a library 
of compounds with a cell expressing (i) a recombinant anti-cell death gene; (ii) 
a second gene encoding a polypeptide; and (iii) a reporter gene that would have 
decreased expression if the function of the polypeptide was blocked; (b) 
determining whether expression of the reporter gene is decreased as a result of 
contact with the compound library; and (c) if expression of the reporter gene is 
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decreased, identifying a compound from the librarv' which decreases reporter 
gene expression. 

In preferred embodiments of the first, second, and third aspects, a 
library of DNA molecules encoding the library of polypeptides are expressed in 
a cell. The cell can be the same cell that expresses a recombinant anti-cell 
death gene and that contains a reporter gene operably linked to a promoter (in 
which case the polypeptide is produced by the same cell that expresses a 
recombinant anti-cell death gene and that contains a reporter gene operably 
linked to a promoter), or the polypeptide can be produced by a cell other than 
the cell that expresses a recombinant anti-cell death gene and that includes a 
reporter gene operably linked to a promoter; preferably the DNA molecules are 
expressed from a high-efficienc}' expression system. 

In other preferred embodiments of the first, second, and third aspect, 
the library of DNA molecules is introduced to the cell by transfection, and the 
mean number of DNA molecules introduced by transfection to the cell is at 
least 25. Preferably, the mean number of DNA molecules introduced by 
transfection into the cell is at least 100 or even 500. 

In still other preferred embodiments of the first, second, and third 
aspect, the polypeptide is selected from the group consisting of an extracellular 
ligand, a cell surface receptor, and a signal transduction intermediate, and the 
DNA molecules are expressed from a high-efficiency expression system. 

In preferred embodiments of the first, second, third, fourth, and fifth 
aspects, step (c) includes (i) dividing the library into two or more libraries with 
less complexity; and (ii) repeating steps (a) and (b) until a polypeptide which 
activates reporter gene expression is identified; the promoter can derived from 
a mammal and the library of poU^peptides can include polypeptides derived 
from a bacterium or a vims. 
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In other preferred embodiments of the first, second, third, fourth, and 
fifth aspects, the promoter is a heterologous promoter, the reporter gene is GFP, 
the anti-cell death gene is selected from the group consisting of bcl family 
members, lAP family members, and crmA, and the cell is selected fi-om the 
group consisting of CHO, CD-I, Cos, 293, HeLa, BHK, or L cells. 

In another aspect, the invention features a method for determining 
whether a compound modulates NF-kB biological activity, the method 
including the steps of: (a) providing a cell expressing a BCMA polj'peptide; (b) 
contacting the cell with a candidate compound; and (c) measuring the level of 
expression of the BCMA polypeptide in the cell, wherein a change in the level 
of expression of the BCMA polypeptide in the cell, relative to a cell not 
contacted with the candidate compound, identifies the candidate compound as a 
compound that modulates NF-kB activity. 

In still another aspect, the invention features a method for 
determining whether a compound modulates BCMA biological activity, the 
method including the steps of: (a) providing a BCMA polypeptide; (b) 
contacting the polypeptide with a candidate compound; and c) measuring the 
level of biological activity of the BCMA pol>TDeptide, wherein a change in the 
level of biological activity of the BCMA pohTpeptide, relative to a polypeptide 
not contacted with the candidate compound, identifies the candidate compound 
as a compound that modulates BCMA activity. The BCMA polypeptide can be 
in a cell or in a cell-fi-ee system. A preferred BCMA biological activity is the 
modulation of NF-kB biological activity (e.g., the modulation of transcription 
by NF-kB) or NF-kB expression. Preferably, the BCMA polypeptide includes 
a polypeptide sequence having substantial identity to amino acids 98 to 164 of 
human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse BCMA (SEQ 
ID NO: 2). 
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In yet another aspect, the invention features a method for 
determining whether a compound modulates NF-kB activity, the method 
including the steps of: (a) providing a BCMA polypeptide; (b) contacting the 
polypeptide with a candidate compound; and (c) detecting the binding of the 
candidate compound to the polypeptide, wherein a candidate compound that 
binds to the polypeptide is a compound that modulates NF-kB biological 
activity. The BCMA pol>T3eptide can be in a cell or in a cell-free system. 
Preferably, the BCMA polypeptide includes a polypeptide sequence having 
substantial identity to amino acids 98 to 164 of human BCMA (SEQ ID NO: 1) 
or amino acids 97-163 of mouse BCMA (SEQ ID NO: 2). 

In another aspect, the invention features a substantially purified 
polypeptide consisting of a BCMA polypeptide molecule lacking a BCMA 
extracellular domain. Preferably, the BCMA polypeptide includes a 
polypeptide sequence having substantial identity to amino acids 98 to 164 of 
human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse BCMA (SEQ 
ID NO: 2). Other preferred polypeptides are BCMA polypeptides that 
modulate NF-kB activity. 

In yet another aspect, the invention features an NF-kB modulator 
including a BCMA polypeptide covalently linked to a heterologous compound. 
Preferably, the BCMA polypeptide includes a polypeptide sequence having 
substantial identity to amino acids 98 to 1 64 of human BCMA (SEQ ID NO: 1) 
or amino acids 97-163 of mouse BCMA (SEQ ID NO: 2). Preferably, the 
modulator modulates NT-kB biological activity or expression. 

The invention also features methods for activating NF-kB activity' in 
a cell by contacting the cell with a recombinant BCMA polypeptide having NF- 
kB activating activity or by contacting the cell with a recombinant nucleic acid 
molecule encoding a BCMA polypeptide having NF-kB activating activity. 



wo 00/50633 



PCT/USOO/04925 



-8- 

In related aspects, the invention features the use of BCMA 
polypeptides and nucleic acids for preparing pharmaceutical compositions for 
treating cancer or apoptosis. 

By a "reporter gene" is meant a DNA or RNA sequence which 
encodes a reporter protein that is capable of being readily detected either inside 
or outside a cell. 

The reporter gene is operably linked to a promoter which shows low 
spontaneous activation, so that the activity of the reporter protein in the 
presence of the activating polypeptide be at least tv^'o standard deviations 
greater than the activity in its absence. Various methods of increasing the 
sensitivity of reporter genes are known in the art, including: deletion from 
natural genes of inhibitory sequences, which may be found both within and 
external to the transcribed portion and which may affect mRNA formation, 
stability, or translational efficacy; addition of efficiently utilized introns to 
increase the rate of formation of mature mRNA; multimerization of upstream 
acti\ ation regions or binding sites for known transcriptional activators; 
increasing the copy number of the reporter gene; and protection of the activity 
of the reporter gene from adventitious stimulatory or inhibitory activity in 
flanking DNA by inserting the reporter gene between matrix attachment 
regions or chromatin insulator sequences. 

Many different, types of reporter proteins are known in the art. They 
frequently comprise proteins not nonmally found, or present in minor amounts, 
m some cells; they include enzymes that detoxify antimicrobial agents, such as 
aminoglycoside or aminocyclitol phosphotransferases or acetyltransferases, 
beta-lactamases or chloramphenicol acetyltransferase; enzymes of diverse 
origin that catalyze chromogenic, fluorogenic, or chemiluminescent reactions in 
the presence of exogenous substrates, such as beta-galactosidase, beta- 
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glucuronidase, alkaline phosphatase, catechol 2,3-dioxygenase, or various 
peroxidases; enzymes that catalyze photoreactions, such as bacterial or firefly 
luciferases; enzymes, like glycosyl transferases, that generate nonproteinaceous 
structures easily detected by antibodies, lectins, or cognate binding proteins; 
proteins easily monitored upon cell surface expression or secretion such as 
surface or secreted antigens for which corresponding antibodies or recognition 
proteins are known: and proteins which catalyze the synthesis of, or 
stoichiometrically embody, fluorescent structures, without exogenous 
substrates, such as the jellyfish fluorescent proteins (e.g., GFP). 

By "operably linked" is meant that the gene for the reporter protein is 
positioned adjacent to a promoter which directs transcription of the gene and, 
ultimately, facilitates expression of the reporter protein. 

By "promoter" is meant any minimal nucleic acid sequence 
sufficient to direct transcription of the reporter gene. The promoter is one 
which is activated by binding to a polypeptide. Examples of promoters useful 
in the invention are promoters which are normally linked to genes which are 
expressed when a cell is in a pathologic or disease state (e.g., cancer, 
inflammation, or due to bacterial or viral infection), and the protein products of 
which are directly or indirectly responsible for this state. Suitable promoters 
include, but are not limited to, the NF-kB promoter, the interleukin-2 promoter, 
and the HIV-1 long terminal repeat promoter. 

A library of DNA molecules refers to a set of DNA molecules, each 
in a DNA expression vector. Preferably, the DNA expression vector displays 
high efficiency such that the level of expression is high. While in most cases, 
the library includes DNA molecules encoding tens, hundreds, or even 
thousands of different polypeptides, DNA molecules in a library can also 
encode only one polypeptide (for example, during the final steps of a sib 



wo 00/50633 



PCT/USOO/04925 



-10- 

sclection). A library with hundreds of different DNA molecules is considered 
to have greater "complexity" than a library with five different DNA molecules. 
Like DNA libraries, compound libraries can have different degrees of 
complexity. One feature of the invention is a method which allows for 
following an activity of interest through the sequential screening of libraries 
with less and less complexity. 

By "candidate compound" is meant a chemical, be it naturally- 
occurring or artificial, that is surveyed for its ability to modulate BCMA or NF- 
kB biological activity. Candidate compounds may include, for example, 
peptides, polypeptides, antibodies (and fragments thereof), synthesized organic 
molecules, naturally occurring organic molecules, nucleic acid molecules, and 
components or derivatives thereof The candidate compounds may be screened 
usmg any of the methods described herein using an animal, a cell, a lysate or 
extract derived from a cell, or a molecule derived from a cell. The measuring 
may be, for example, for the purpose of detecting altered gene expression, 
altered RNA stability, altered protein stability', altered protein levels, altered 
protein phosphorylation, or altered protein biological activity. The means for 
measuring may include, for example, antibody labeling, immunoprecipitation, 
phosphorylation assays, and methods known to those skilled in the art for 
detecting nucleic acids. 

By "modulating" is meant changing, either by decrease or increase. 

By "BCMA polypeptide" is meant a polypeptide that has substantial 
identity to human or mouse BCMA as shown in Figs. 7A and 7B, respectively, 
over a region of t\vent\' consecutive amino acids and has a BCMA biological 
activity'. 

By "substantial identity" is meant that two polypeptide sequences, 
when optimally aligned, such as by the GAP or BESTFIT programs using 
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default gap weights, share at least 80% sequence identity, more preferably at 
least 90% sequence identity, and most preferably at least 95% or even 99% 
sequence identity or more. 

By "heterologous compound" is meant a polypeptide, chemical, or 
detectable label (e.g., biotin) that is not naturally associated, in this case, with 
the polypeptide. 

By "BCMA biological activity" is meant BCMA-mediated 
modulation of NF-kB biological activity, or expression or the binding of an 
antibody that specifically binds a BCMA polypeptide. 

By ''NF-kB biological activity" is meant any function performed by 
activated NF-kB. 

The invention features a method of screening DNA and compound 
libraries for their ability to modulate reporter gene expression in a cell which is 
expressing a recombinant anti-cell death gene. The expression of this gene 
allows the cell to survive in conditions which would otherwise lead to its death 
and, as a result, failure of the screening procedure. 

This method is suitable for identifying polypeptides that modulate 
transcription fronT^sglgcted prom oter. These polypeptides are, in turn, targets 
for dioigs. The method is also suitable for identifying compounds which either 
mimic or block function of a polypeptide which itself modulates transcription 
from a selected promoter. 

Hence, the invention features a new, efficient multistep method for 
identifying lead compounds which modulate expression from a promoter for 
development of pharmaceutical compounds. 

The invention also features a new NF-kB activator, BCMA. As KF- 
kB is involved in numerous cellular processes and disease states, BCMA is 
useful for the treatment of disease. BCMA is also useful for the identification 
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of 



„, compounds *a, modulate its expression or btological activ,t> . These 
compounds may be developed as drugs, or used as lead compounds for the 

purpose of identifying drugs. 

other feanrres and advantages of the invention will be apparent from 

the following detailed description, the drawings, and the claims. 

T^j^-fp.. Option "f'])' nrawings 
Fig 1 is a schematic illustration of a reporter cell system for rapid 
Identification of cDNAs encoding polypeptides of interest. A cDNA libra,, is 
„ansfec,ed into a cell which includes a reponer gene (e.g., GFP or Thy-l) 
operably linked to a promoter (,n this case the NF-KB promoter). If the 
polypeptide expressed by one of the plasmids in the reporter cell is capable of 
Inducing expresston from the NF-kB promoter, the reporter gene .s expressed, 
and the detectable marker protein is produced. 

Fio ■> .s a schematic illustration of a cDNA library being d.v,ded 
,nto smaller Lanes. Each of the hbraries is then transfected into reporter 
cells Libraries which induce reporter gene expression are then further dtv.ded 
until the cDN A(s) encoding the polypeptide which induces reporter gene 

expression are isolated. 

Fi» 3 is a schematic- illustration of three stages of enrichment for an 

intracellular s.gnal transduction intermediate. The ratios represent the 
occutrence of the cDNA encoding the reponer gene-inducmg pol>pep«de 
compared to the total cDNAs. Note that, during transfection, each cell recetves 

more than one plasmid. 
5 F,g 4 is a schematic tllustration showing how secreted factor can be 

d,s„ngu,shed from intracellular stgnal transduction tntermediates such as 
receptors or kinases. Soluble l.gands diffirse through the medium ,0 engage 
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feced cells wWch can lead to reporter gene expression m 

Mils- ■ f.,«,ved clones. N= elements, 

^- ^i. a schematic illustration ot arrayed Clone 

F,g. . IS a schem „ansfections. In 

pooled intoN columns and>^ rows, canbe assay 

1 provided 4 rows and 4 columns were positive. 
„e example proMded, ^ i„,,„„tions unambiguously. 

l.fonoUninde.edU^^^ 

eledinP..ltoPiS..«^^^^ 
n^ethod is similar to the one piasmids are then 

"Tt::— tic— , 

Fig. 7A IS a sLiic 0099?^^ The putative 

.r.nRank accession number Q0222 J), m p 
5 polypeptide sequence (GenBankac 

^^^^^ 

,r,„Ranlc accession number AAC237OT). 
polypeptide sequence (GenBankac ,,„^e intracellular 

P„. s is a schematic illustration showing ^ 
RrMA was capable of activating NF-1J3. A s 
JO domain of human BCM.Awa ^ ,,,„,„,e, IgG Fc, a CD7 

proteins were constructed usmg the CD ^^^^ ^^^^ , ^ 

„ansmembrane domain, and ,,1 64; AC40- 

as follows: ,g7bcma-am.no adds 78-18 ^^^^ ^^^^^ 

amino acids 78-144; AC60-amino ™- ^ 3,.,, 138-184; 

OR 1R4- AN40--amino acids llb-ls^, ^ 
25 -amino acids 98-1 84, z^-^h 

A>^80-amino acids 158-184. 
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Detailed DescripTjnp 
The present invention provides a method for the rapid identification 
of molecules involved in regulation of specific signal transduction pathways. It 
can be used both as a target identification tool and as a rapid assay for drug 
effectiveness, and as such can significantly reduce the amount of time needed 
to go from target to hit. 

The method features an expression cloning approach that identifies 
polypeptides that have the ability to activate reporter genes (Fig. 1 ). Such 
polypeptides are called here activating polypeptides. Expression cloning is a 
technique which identifies polypeptides solely on the basis of their ability to 
generate an observable activity of interest. In order for that activity to be 
identified, the polypeptide must generally be expressed, either in vivo or /« 
vitro, and a suitably sensitive assay must be available to detect the activity after 
the polypeptide has been expressed. In the present method, it is preferred that 
the activity that is detected is dependent on the output of a reporter gene (Fic. 
1)- ■ ^ 

Although the use of expression cloning in conjunction with reporter 
genes is not new, existing methods of application have not resulted in highly 
efficient systems for the identification of large numbers of new molecules. The 
present approach is a high throughput system for the identification of cDNA 
clones encoding polypeptides that induce reporter gene activity. Some features 
of the high throughput system of the present invention are: (1 ) a very high 
efficiency cDNA expression plasmid; (2) an easily detected reporter molecule; / 
(3) feaujres to prevent the death of the reponer cell due to toxicity of expressed 
genes; and (4) a method for the introduction of large numbers of plasmids into 
multiple cells. 

Expression cloning can be carried out by either of two general 
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paradigms, positive selection, or sib selection, also known as pool division. By 
positive selection is meant a method for the enrichment of cells, viruses or 
genetically linked assemblies of proteins and the nucleic acids that encode them 
by physically separating the cells or genetically linked assemblies from the 
much larger collection of cells or assemblies that encode molecules of little 
interest. Because the nucleic acids that encode the polypeptides of interest are 
physically linked to the cells, viruses, or other assemblies that they comprise, 
the nucleic acid is physically purified by positive selection and the process can 
be repeated until a single nucleic acid is found that encodes the activity of 
interest. It is desirable in positive selection schemes to develop methods for 
association of a single nucleic acid with a single protein assembly or cell. Thus 
in positive selection it is desirable to identify and use methods for the 
introduction of nucleic acids into cells that result in a single nucleic acid 
species per cell. 

The method of the invention uses sib selection, a process of 
screening in which pools of molecules that are derived from cognate libraries of 
nucleic acids are assayed for activity, and positive libraries are detected by 
some signal (e.g., expression of a reporter gene). The nucleic acids that make 
up the library are then separated into libraries with less complexity, which are 
then reassayed and redivided until a single nucleic acid is found that encodes 
the polypeptide with the desired activity (Figs. 2, 3, and 4) Because sib 
selection depends on the detection of the activity of aggregates of nucleic acids, 
it is often advantageous to use methods for the transfection or introduction of 
nucleic acids into cells that result in a large number of nucleic acid species per 
cell. Sib selection schemes, like positive selection methods, can also be carried 
out entirely in vitro. 
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Features of the Invention 
Transfeciion Method 

For sib^sejectioryo bean e fficient met hod of cloning genes, 



according to the^nvention, it is important that a large number of DNA molecule 



are introduced into each celT Many methods of introducing DNA molecules 
are known in the art, including microinjection, complexation with positively 
charged synthetic polycations such as DEAE dextran, polybrene, polylysine, or 
polyarginine, complexation with histones and other basic proteins, 
complexation with cationic lipids or related amphipathic molecules, 
condensation with polyethylene glycol or polyhydroxybutyrate, coprecipitation 
with calcium phosphate, electroporation, scrape loading or partial rupture, and 
fusion with bacterial or microbial spheroplasts. Preferred among these are 
methods that can be easily carried out in parallel and that result in the co- 
introduction into the reporter cell of multiple nucleic acids that encode different 
species. Moreover, the method of transfection will preferably provide, on 
average, at least 25 DNA molecules per cell. More preferably, the mean 
number will be at least 1 00 or even 500 DNA molecules per cell. The methods 
which achieve the preferred results include calcium phosphate coprecipitation, 
complexation with polycations or cationic lipids, and condensation with 
20 uncharged polymers such as polyethylene glycol. 

The libraries of nucleic acid molecules encoding potentially 
activating polypeptides can also be created in a biologically active assembly, 
such as a virus or viral transducing particle, which is capable of introducing 
itself into the reporter cell directly. In such a case a cDNA library is prepared 
in the viral vector, and libraries of active vims or transducing particles are 
applied to the reporter cell. In general the reporter cell will have been 
pre\ iously engineered to contain a reporter gene, but the reporter gene may also 
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be introduced concurrently with the activating nucleic acid. 

High Efficiency cDNA Expression Vector- 
To detect ectopic activation of a signal transduction pathway it is 
desirable to greatly overexpress proteins that act as signal transduction 
intermediates. 

The invention uses a high efficiency cDNA expression system to 
produce proteins in the reporter cell. Such a system is provided by the use of a 
very strong promoter, such as, for example, the elongation factor 1 alpha (EF- 
1 a) promoter; 3* untranslated region (3* UTR) and polyadenylation consensus 
(poly(A)) sequences from the human growth hormone gene; and the human 
IgGl hinge-CH2 intron. Other strong promoter and nonpromoter elements are 
known in the art (for example, the murine or human cytomegalovims 
immediate early gene promoters, globin introns, and 3' UTRypoly(A) 
sequences). 

Cell Death Inhibition 

Another important feature of the present system is a method to 
prevent the death of cells overexpressing proteins. Such a method is important 
for two reasons: signal transduction intermediates themselves can lead to cell 
death if they are expressed at high levels, and, in addition, if libraries of nucleic 
acids are transfected, the presence of even a low frequency of nucleic acids 
encoding toxic proteins can interfere with detection of the desired signal. The 
latter effect can be predicted to have greater impact as the size of the librar>' 
increases. 

There are several known methods to prevent the demise of cells 
undergoing programmed cell death, or apoptosis. Both viral and cellular 
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antagonists of cell death are known, and among them are species that act 
upstream in the apoptosis pathway, or at multiple levels. Examples of the 
former include dominant negative forms of cell death proteins, such as FADD 
or TRADD, and cellular inhibitors of apoptosis, such as members of the viral or 
cellular lAP family. Examples of proteins that act at multiple levels, or 
relatively downstream, include the orthopoxvirus CrmA and baculovirus p35 
proteins, members of the cellular Bel family, and both peptide and nonpeptide 
inhibitors of caspases, the cysteine proteases which act in a zymogen cascade to 
generate the death program. For the prevention of apoptosis in expression 
cloning, combinations of broad spectmm antiapoptotic proteins v^'ith different 
mechanisms of action are preferred, such as CrmA and Bcl-xL. 

Examples 

Identification of gene products leading to the activation of the transcription 
factor NF-kB 

We established a reporter cell line by stably transfecting the 
commercially-available human embryonic kidney cell line, 293 EBNA, with 
two constructs: a reporter construct consisting of NF-kB promoter elements 
upstream of the green fluorescent protein (GFP); and a eukaryotic expression 
construct that expresses two anti-cell death molecules, CrmA and Bcl-xL. The 
former functions as a readout for the presence of signals that activate the NF- 
kB signal transduction pathway, whereas the latter prevents these cells from 
undergoing programmed cell death in the event that pro-apoptotic signals are 
present. The expression in the reporter cell line of KF-kB-activating 
molecules, including receptors (e.g. tumor necrosis factor-a receptor ]), ligands 
(e.g. tumor necrosis factor-a) and intracellular signaling proteins (e.g. RIP), 
results in the robust production of GFP. The level of GFP production, as a 
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measure of NF-kB activation, can be assayed using a fluorescent microscope or 
flow cytometry. High level expression of these various NF-kB-activating 
molecules was achieved by placing their cDNAs under the control of the very 
strong EF- 1 a promoter present in the commercially available plasmid PEAKS 
(Edge Biosystems). Thus, one can transfect into these cells cDNA libraries and 
identify individual cDNAs which are capable of NF-kB activation. 

A cDNA library, prepared from activated human T cell mRNA and 
cloned into the PEAKS vector, was subdivided into smaller Hbraries of 
approximately 500 cDNA clones each, and DNA was prepared from each 
library. The DNA from the libraries were transfected into the reporter cell line 
by calcium phosphate precipitation following published protocols, e.g., 
Ausubel et al. et al., 1997, Current Protocols in Molecular Biology^, Wiley 
Interscience. Cells were allowed to continue to grow for 48 hours and assayed 
for GFP production. cDNA libraries that gave a positive GFP signal, as 
defined by the appearance of at least 0.1% of bright cells by fluorescence 
microscopy, were screened further by sub-dividing into libraries of 50 cDNAs 
each (Fig. 2). DNA from each library was prepared, transfected into the 
reporter cell line and assayed for GFP production. The process was repeated 
until a single cDNA clone that induced a positive GFP readout was obtained. 
The screening procedure has resulted in the cloning of DNAs encoding known 
NF-kB-activating molecules such as surface receptors (DR3, FAS), soluble 
ligands (interleukin-1 , TRAIL, CD40 ligand), intracellular signaling molecules 
(small molecular weight GTPase rho). Also identified were a previously 
known molecule having no known fijnction (BCMA) and novel molecules. 

The inclusion of two anti-cell death genes in the transfected cells are 
likely to have aided in the expression cloning of at least some of the above- 
mentioned DNAs. It has been established that expression of either FAS or DR3 
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in a cell would induce apoptosis in that cell. The fact that each was retrieved 
substantiates the improved nature of the expression cloning method of the 
present invention. Similarly, as the method of the present invention preferably 
employs high copy number, it is likely that expression of anti-cell death genes 
allows for cells that otherwise would have died, due to a high number of 
inserts, to survive. 

The method is more rapidly and efficiently applied by using indexed 
arrays of bacterial cultures, in which each culture is derived from a single 
colony, and, thus, represents an independently derived cDNA expression 
plasmid (Fig. 5). By pooling small amounts of the cultures from rows and 
columns of large arrays and preparing DNA from each of the libraries, it is 
=* possible to assess the relative activity of every row and every column, thereby 

5 reducing labor dramatically (Fig. 6). For example if there are 90,000 

2 individual/clones arrayed in a 300 by 300 matrix, the 300 rows and 300 

columns can be transfected to determine all of the rows and all of the columns 
that bear positive clones. Hence the array can be indexed in only 600 DNA 
preparations and transfections. If there is only one positive culmre, the job is 
complete. If there are multiple positive cultures, then in the worst case, each 
row and each column will have only one positive culture, and if there are x 
20 positives, then there are x- possible intersections. As long as x is a small 

number, though, the work involved is quite modest. For example if x were 10, 
the entire array of 90,000 clones could be screened in only 700 DNA 
preparations and transfections. 

Variations of the experimental approach outlined above are 
25 applicable depending on the particular system that is to be examined. The 

reporter system and cell lines can be adapted to the promoter to be investigated. 
For instance, the NF-kB promoter in the GFP reporter construct can be replaced 
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by the inter! eukin-2 promoter, introduced into a T lymphocyte cell line and 
screened for molecules that regulate the signal transduction pathway leading to 
the expression of interleukin-2 in T lymphocytes. Other promoters can be 
drawn from viral sources, such as the HIV-1 long terminal repeat (LTR) 
promoter, or other inducible promoters of viral origin. The calcium phosphate 
transfection procedure can be changed to another method more suitable for the 
particular reporter cell type. The alternative transfection protocols could be 
based on electroporation, cationic lipids, DEAE-dextran, spheroblasts fusion or 
viral-mediated delivery. 

A variation to the method described above is to screen for molecules 
that will turn off expression from a promoter. For instance, the NF-kB reporter 
in the reporter cell line is activated in response to interleukin-l stimulation. By 
including interleukin-l in the culture medium, one can then search for 
molecules that will inhibit the interleukin-l dependent expression from the NF- 
kB promoter. Depending on the cDNA library that is used, these molecules can 
be either ones that are naturally negative regulatory or they can be mutant 
versions that behave in a dominant inhibitory manner. Similarly, screens can 
be conducted with a combinatorial library to look for small pharmacological I 
molecules that will negatively interfere with the pathway (see below). 

The screening methods described above are well-suited for screening 
for genes from one organism that interact with a pathway in another organism. 
A prime example is screening the expressed genomes of viruses, bacteria or 
other pathogens for genes that, when expressed, might interact with the NF-kB 
pathway, This method, utilized by these pathogens to alter the immune 
response to their advantage, can identify potential targets for pharmacological 
inten'entions. 

The reporter cell may also be provided with proteins which increase 
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the efficiency of the reponer gene. These proteins arc usually introduced by 
transfection of an expression plasmid that encodes them. Proteins which 
increase the efficiency of the reporter gene may provide essential elements of a 
signaling pathway which are not otherwise present in the reporter cell, or may 
consist of artificial proteins that act to integrate, amplify, or selectively respond 
to signals from the pathway of interest. Of special relevance are artificial 
transcriptional activators which consist of a DNA binding element that interacts 
specifically with cognate binding sites in an artificial promoter and a pathway- 
specific transcriptional activator element that responds to activation of the 
pathway with a change in transcriptional activity. Such artificial activators are 
known in the art as fusions between bacterial or yeast DNA binding proteins 
and mammalian transcription factor activation domains. Such activation 
domains provides sites for protein binding, ligand-activated conformational 
change or post-translational modification, that increase the transcription- 
promoting capabilities of the artificial activator. Examples of artificial 
activators that are commercially available or known in the art include LexA and 
Gal4 f\isions with c-Jun, Elkl, CREB, c-Fos. ATF2, CHOP, and members of 
the nuclear hormone superfamily. 

In one example, a cell which contains (i) a recombinant anti-cell 
death gene, (ii) a chimeric transcription factor consisting of the Gal4 DNA 
binding domain fused to the c-Jun activation domain, and (iii) DNA encodins 
GPP operably linked to regulatory sequence consisting of a basal promoter and 
Gal4 binding sites is used to identify polypeptides or compounds which 
modulate c-Jun activation. If, for example, expression of a polypeptide leads to 
c-Jun phosphorylation (i.e., activation), then increased GFP expression would 
result. 
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BCMA is an activator of NF-kB 

Using the methods described herein, we identified BC\4A as an NF- 
kB activator. BCMA was discovered by molecular analysis of a i(4;16) 
translocation characteristic of a human T cell lymphoma (Laabi et al., EMBO J. 
5 11: 3897-3904, 1992), and its function was not previously known. 

Based on the present discovery, BCMA is a target for drug discovery 
or rational drug design. For example, a compound that modulates BCMA 
^ expression or biological activity will also modulate NF-kB biological activity. 

Accordingly, the invention features methods and reagents for the identification 
1=4 10 of NF-kB modulating compounds. 

BCMA polypeptides or nucleic acid molecules are also useful for the 
\q treatment of diseases associated with insufficient or inappropriate NF-kB 

biological activity or expression. BCMA polypeptides or nucleic acid 
1^ molecules are administered to a patient using an appropriate delivery vehicle, 

^^15 as known in the art. Generally, the BCMA polypeptide or nucleic acid 
RJ molecule is delivered in a pharmaceutically acceptable carrier. 

Coupled Target Identification and Assay Generation 

Directly relevant to the NF-kB signal transduction pathway is the 

20 identification of molecules that could potentially play a role in regulating 

inflammation and oncogenesis. The search for NF-kB activating molecules do 
not have to be restricted to screening cDNA expression libraries; the same 
principles can be used to identify compounds which modulate the output of the 
pathway, either by mimicking the activity of a polypeptide or by blocking its 

25 activity when the polypeptide is overexpressed. The latter compounds will act 
either upon or downstream of the overexpressed protein; once enough 
activating proteins have been identified, it will be possible to identify at what 
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step the compounds act by e\'aluating their action on a panel of transfected cells 
expressing different activating proteins. 

In general, compounds are identified from large libraries of both 
natural product and synthetic (or semi-synthetic) extracts or chemical libraries 
according to methods known in the art. Those skilled in the field of drug 
discovery and development will understand that the precise source of test 
extracts or compounds is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any number of chemical extracts or 
compounds can be screened using the methods described herein. Examples of 
such extracts or compounds include, but are not limited to, plant-, fungal-, 
prokaryotic- or animal-based extracts, fermentation broths, and synthetic 
compounds, as well as modification of existing compounds. Numerous 
methods are also available for generating random or directed synthesis (e.g., 
semi-synthesis or total synthesis) of any number of chemical compounds, 
including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 
based compounds. Synthetic compound libraries are commercially available 
from Brandon Associates (Merrimack, NH) and Aldrich Chemical (Milwaukee, 
WI). Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant, and animal extracts are commercially available from a number of 
sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch 
Oceangraphics Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. (Cambridge, 
.MA). In addition, natural and synthetically produced libraries are produced, if 
desired, according to methods known in the art, e.g., by standard extraction and 
fractionation methods. Furthermore, if desired, any library or compound is 
readily modified using standard chemical, physical, or biochemical methods. 

In addition, those skilled in the art of drug discovery and 
development readily understand that methods for dereplication (e.g., taxonomic 
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dereplication, biological dereplication, and chemical dereplication. or any 
combination thereof) or the elimination of replicates or repeats of materials 
already knowTi to modulate the test promoter should be employed whenever 
possible. 

Wlien a crude extract is found to modulate reporter gene expression, 
further fractionation of the positive lead extract is necessary to isolate chemical 
constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization 
and identification of a chemical entity within the crude extract having the 
desired activity. Methods of fractionation and purification of such 
heterogenous extracts are known in the art. If desired, compounds shown to be 
useful agents for the modulation of gene expression or biological activity are 
chemically modified according to methods known in the art. 

Primaiy screens for compounds that modulate BCMA biological activity 
Modulating NF-kB expression or biological activity modulates 
numerous important cellular processes. The finding that NF-kB activity is 
regulated by BCMA allows us to provide assays for dmgs that modulate NF-kB 
by monitoring BCMA expression or biological activity. Such assays may 
measure BCMA expression by measuring changes in: (a) levels of BCMA 
protein; (b) levels of BCMA RNA; (c) levels of BCMA-mediated NF-kB 
biological activity; or (d) levels of a reporter gene or protein expressed from a 
NF-kB promoter. These measurements may be made in vitro or in vivo. These 
assays allow for the identification of compounds that modulate NF-kB 
biological activity (e.g., gene transcription). Such identified compounds may 
have therapeutic value, for example, in the treatment of diseases that result in 
too little or too much cell death. 
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Cells overexpressing BCMA can be produced using standard 
techniques. Compounds that are identified may bind to BCMA and prevent 
BCMA activation of NF-kB. While screening of compounds can be performed 
with cultures of primary cells, cell lines be also be used. Cell lines can be 
modified such that the cells constitutively express a BCMA polypeptide, for 
example, the BCMA intracellular domain. 

Any cell line, such as ones described above, can also be engineered 
to contain a reporter gene expressed under control of the NF-kB promoter 
(described above). A preferred reporter gene codes for GFP. Typically, the 
expression of the gene (e.g., the endogenous NF-kB gene or a recombinant 
reporter gene expressed under the control of the NF-kB promoter or fragment 
thereof) is measured by assaying the RNA or protein levels or both of the 
expressed gene. For example, the polypeptide expressed by the NF-kB gene or 
by the reporter gene produces a detectable signal under conditions that allow 
compound-mediated changes to be measured. Quantitatively determining the 
amount of signal requires comparing the amount of signal produced in the 
absence of any compound being tested to the amount produced when the cell is 
contacted with the compound, as is described herein. The comparison permits 
the identification of the compound as one that causes a change in the detectable 
signal produced by the expressed gene (e.g., at the RNA or protein level) and 
thus identifies a compound that is capable of modulating NF-kB expression. In 
order to prevent the NF-kB cells from dying, a second gene encoding an 
apoptosis inhibitor can also be expressed in the cells, as described herein. 

Secondaiy screens for compounds that modulate NF-kB activity 

After test compounds that appear to modulate NF-kB expression are 
identified, it may be necessary or desirable to subject these compounds to 
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further testing. The invention provides such secondary confirmator>' assays. 
For example, a compound that appears to modulate NF-kB activity^ in early 
testing may be subject to additional assays to confirm that the compound also 
modulates NF-kB activity in vivo. In the first round of in vivo testing, NF-kB 
activity is initiated in animals by well-known methods and the compound is 
administered by one of the means described in the "Therapy" section, 
immediately below. Cells or cellular tissue are isolated within hours to days 
follov^'ing the insult, and are subjected to assays to assess the level of NF-kB 
expression or biological activity. Such assays are well known to those skilled 
in the art. Examples of such assays include, but are not limited to, ELISAs, 
Western blot analysis, RT-PCR, RIA, and Northern blot analysis. 

Therapy 

NF-kB is an important regulator of inflammatory responses (e.g., 
rheumatoid arthritis, inflammatory bowel disease, septic shock), apoptosis, 
oncogenesis, and anti-viral and anti-bacterial responses. Therefore, the 
discovery of new gene products that regulate NF-kB activity, and thus the 
disease process, will result in the identification of molecular targets for 
phamiaeological intervention. By increasing or mimicking BCMA biological 
activity, one could, for example, boost anti-tumor antibody production or 
increase T cell cytotoxicity against tumor cells in cancer immunotherapy. 
Conversely, antagonizing BCMA biological activity would be advantageous, 
for example, in situations in which it is desirable to down-regulate immune cell 
function. Compounds, identified using any of the methods disclosed herein, 
may be administered to patients or experimental animals with a 
phamiaceutically- acceptable diluent, carrier, or excipient, in unit dosage form. 
Conventional pharmaceutical practice may be employed to provide suitable 
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formulations or compositions to administer such compositions to patients or 
experimental animals. Although intravenous administration is preferred, any 
appropriate route of administration may be employed, for example, parenteral, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 
intraventricular, intracapsular, intraspinal, intracistemal, intraperitoneal, 
intranasal, aerosol, or oral administration. Therapeutic formulations may be in 
the form of liquid solutions or suspensions; for oral administration, 
formulations may be in the form of tablets or capsules; and for intranasal 
formulations, in the form of powders, nasal drops, or aerosols. 

I Methods well known in the art for making formulations are found in, 

I for example. Remington: The Scie nce and Practice of Pham-iar-y ( 1 9th ed.) ed. 

J Gennaro AR., 1 995, Mack Publishing Company, Easton, PA. Formulations for 

0 parenteral administration may, for example, contain excipients, sterile water, or 

□ saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable 

5 15 origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide 
3 polymer, lactide/glycolide copolymer, or polyoxyethyl ene-polyoxypropyl ene 

copolymers may be used to control the release of the compounds. Other 
potentially useful parenteral delivery systems for antagonists or agonists of the 
invention include ethylene-vinyl acetate copolymer particles, osmotic pumps, 
20 implantable infusion systems, and liposomes. Formulations for inhalation may 
contain excipients, for example, lactose, or may be aqueous solutions 
containing, for example, polyoxyethylene-9-lauryl ether, glycocholate and 
deoxycholate, or may be oily solutions for administration in the form of nasal 
drops, or as a gel. 

As described herein, we have discovered that BCMA activates NF- 
kB activity. NF-kB, in turn, activates numerous cellular processes. Hence, any 
compound that modulates NF-kB expression is a candidate compound for use 
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in drug development. One possible compound is a polypeptide fragment of 
BCMA which maintains its ability to bind to another component of the signal 
transduction pathway but has lost its ability to activate NF-kB. Such a 
polypeptide will act as an inhibitor of wild-type NF-kB signaling . Another 
possible compound is a polypeptide fragment of BCMA which exhibits 
constitutive activation of NF-kB. 

Other Embodiments 

All publications and patent applications mentioned in this 
specification are herein incorporated by reference to the same extent as if each 
independent publication or patent application was specifically and individually 
indicated to be incorporated by reference. 



embodiments thereof, it will be understood that it is capable of further 
modifications and this application is intended to cover any variations, uses, or 
adaptations of the invention following, in general, the principles of the 
invention and including such departures from the present disclosure come 
within known or customary practice within the art to which the invention 
pertains and may be applied to the essential features hereinbefore set forth. 



While the invention has been described in connection with specific 



WTiat is claimed is: 
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1 . A method for identifying a polypeptide which increases gene 
expression from a promoter, said method comprising the steps: 

(a) contacting a library of polypeptides with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is increased 
if said library comprises a polypeptide which increases gene expression from 
said promoter; 

(b) determining whether said reporter gene expression is increased in 
said cell as a result of contact with said polypeptide libraiy; and 

(c) if said reporter gene expression is increased, identifying a 
polypeptide of said library w^hich increases said reporter gene expression. 

2. The method of claim 1, wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a polypeptide which activates reporter gene expression is 
identified. 

3. A method for identifying a polypeptide which decreases gene 
expression from a promoter, said method comprising: 

(a) contacting a library of polypeptides with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is decreased 
if said libraiy comprises a polypeptide which decreases gene expression from 
said promoter; 

(b) determining whether said reporter gene expression is decreased in 
said cell as a result of contact with said polypeptide library; and 

(c) if reporter gene expression is decreased, identifying a polypeptide 
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.hich decrease, said reporter gene express,ou. 

.....e.odo..™3. — s.ep„ns^ 

3 steps(a)and(b)untilapolypepttdewh,ch 

identified. — - — 

' c ■ r^nivneotide which modulates 

5. A„e.hodfcr,de«ify-ngapo>y epu ^ 

• ■ f „inr activation domain, saia mcu 
activation of a transcnpnontacto ..^es v/itl> a cellthat expresses a 

contactins a i.brar, '^^^"^'^'''^. ^^^^^ .insisting 
,.e— ant,.ce„ dear, .ene and a c..en - P^^^^^_^^ 

„f e yeas, or bacteria, DNA ..nd.ng donta.n ed 
,«„.ac.Wationdontain,andtKa.contpnses P- 

.promoter cons.stin.ofaWiprontote-d^^^-^^^^^ 

„„d,ns domain, where.y expression of 
«,,arv comprises a polypeptide which modulates ac 

-cenasares.Uofconlact.ithsa.dpo,.e^ae.^^^^^^^^ 

(c) if reporter gene expression IS altered, 
„h,ch modulates said reporter gene expression. 

„ a of claim 5, wherem step (c, comprises (i) dividing 
, The method of ^m ^^^^ 

said library mio two or more libraries ^^^^^^^,„„ 
3.eps (a) and Cb, un.il a polypeptide which modulates 
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^ 7. The method of claim 1 , 3, or 5, wherein said contacting comprises 

^ expressing a library of DNA molecules in a cell, wherein said library of DNA 

> molecules encodes said library of polypeptides. 



/ 



The method of claim 7, wherein said cell is the same cell that 



5/8. 

/ expresses a recombinant anti-cell death gene and that comprises a reporter gene 
operably linked to said promoter. 



i 9. The method of claim 7, wherein said Ubrary of DNA molecules :s 

1 o! introduced to said cell by transfection. 



10. The method of claim 9, wherein the mean number of said DNA 
molecules introduced by transfection into said cell is at least 25. 

J 5 1 1 . The method of claim 9, wherein the mean number of said DNA 

molecules introduced by transfection into said cell is at least 100. 

12. The method of claim 9, wherein the mean number of said DNA 
molecules introduced by transfection into said cell is at least 500. 



20 
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13. The method of claim 7, wherein said DNA molecules are 
expressed from a high-efficiency expression system. 

14 The method of claim I, 3, or 5, wherein said polypeptide is 
produced by the same cell that expresses a recombmant ant.cell death gene and 
that comprises a reporter gene operably linked to said promoter. 
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15. The method of claim 1, 3, or 5, wherein said polypeptide is 
produced by a cell other than the cell that expresses a recombinant anti-cell 
death gene and that comprises a reporter gene operably linked to said promoter. 

16. The method of claim 1, 3, or 5, wherein said polypeptide is 
selected from the group consisting of an extracellular ligand, a cell surface 
receptor, and a signal transduction intermediate. 



1 7. A method for identifying a compound which modulates gene 
expression from a promoter, said method comprising: 

(a) contacting a library of compounds with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is altered if 
said library comprises a compound which modulates gene expression from said 
promoter; 

(b) determining whether said reporter gene expression in said cell is 
altered as a result of contact with said compound library; and 

(c) if said reporter gene expression is altered, identifying a 
compound from said library which modulates said reporter gene expression, 

IS. The method of claim 17, wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a compound which modulates gene expression fi-om a 
promoter is identified. 

19. The method of claim 1, 3, 5, or 17, wherein said promoter is a 
heterologous promoter. 
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.0 The method of cla.m 1,3,5, or 17, wherem said promoter .s 
denved from a mammal and said library of polypeptides compnses 
polypeptides derived from a bacterium or a virus. 

. f 1 n,! 5 5 orl7 wherein said reporter gene is 
21. The method of claim 1 , 3, 5, or i / , 

GFP. 

a. The method of d.™ 3, 5, or H, wherein said an.i-ce.l death 
0 members, and crmA. 

. , ■ I 1 or 17 wherein said cell is selected 
23 The method of claim 1, 3, 5, or l /, wn 

■ fPHO CD-I Cos, 293, HeLa,BHK^orJ. ceUs^^__ 
from the group rorasnng_oj;CHO,CU l ,>-o , 

,. ,4 A method for determining whether a compound modulates 

NF-kB bioloaical activity, said method comprising the steps of: 

a) providing a cell expressing a BCMA polypeptide; 

b) contacting said cell with a candidate compound; and 

c) measuring the level of expression of said BCMA polypeptide 

,0 said ce„, wherein a change m the level of expression of said BCMA 

■ .cell relative ,0 a cell not contacted with said candidate 
polypeptide m said cell, relative modulates 
compound, identifies said candidate compound as a compound 

NF-kB biological activity. 
^f^" ' method for detennining whether a compound modulates 

BCMA biological activity, said method comprising the steps of: 
a1 providing a BCMA polypeptide; 
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b, ccnu,«ing said polypeptide wi* a cand,date compound; and 

c) measuring *e ,eve, of biological aCivity of said BCMA 

■ „„i„ the level of biological activity of said BCMA 

polypeptide, wherein a change in the icvei » 

BCMA biological activity. 

. ^ ■ -y; wherein said BCMA polypeptide is in a 
26. The method of claim 25, wherem saia 

cell. 

^ . , • 0^ wherein said BCMA polypeptide is in a 

27. The method of claim 25, wherem saiu 

cell-free system. 

9^ wherein said BCMA biological activity 
28. The method of claim 25, wherem saia 

,5 is the modulation of NF-kB biological activity. 

. The method of claim 2S, wherem said NF-.B biological activity 
is the modulation of cell death. 

-1^ whprein said BCMA polypeptide 
30 The method ofclaim 25, wherem saia 

_ses a polypeptide science having snbstantia, ide tit 

T.^vAA rQPomNO- 1) or ammo acids 97-103 om 
98-164 of human BCMA (SEQ ID MU. i; 

• BCMA (SEQ ID NO: 2). 

' 31. A method for determining whether a compound modulates 
NF-kB activit>-, said method comprising the steps of: 
a) providing a BCMA polypeptide; 
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b) contacting said polypeptide with a candidate compound; and 

c) detecting the binding of said candidate compound to said 
polypeptide, wherein a candidate compound that binds to said polypeptide is i 
compound that modulates NF-kB biological activity. 

32. The method of claim 31, wherein said BCMA polypeptide 
comprises a polypeptide sequence having substantial identity to amino acids 
98-164 of human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse 
BCMA (SEQ ID NO: 2). 

53— ^ " 

yZ. A substantially purified polypeptide comprising a polypeptide 

sequence having substantial identity to amino acids 98-164 of human BCMA 
(SEQ ID NO: 1) and not having amino acids 1-54 of human BCMA (SEQ ID 
NO: 1). 

3M 

2S: The polypeptide of claim 32, wherein said polypeptide 
modulates NF-kB activity. 

35- 

34: The polypeptide of claim 32, wherein said polypeptide consists 
of amino acids 98-164 of human BCMA (SEQ ID NO: 1). 

2r6. A substantially purified polypeptide comprising a polypeptide 
sequence having substantial identity to amino acids 97-163 of mouse BCMA 
(SEQ ID NO: 2) and not having amino acids 1-49 of mouse BCMA (SEQ ID 
NO: 2). 

31 

The polypeptide of claim 35, wherein said polypeptide 
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niodulatesNF-kB activity. 

^T.epo.ypept.eofc...35,..e.e.sa.po.ypept.dec^^^^^^^ 

o.a..noa.ds97-.63_ofn^eBj:M^^^^^^ 

' ^'iZ^^ —or — ng a polypepUde of ... 32 o. 

35 covalently linked to a heterologous compound. 

HO , f .laun 38 wherein said modulator modulates 

39. The modulator of claim 3», wn ^ 

10 NF-kB activity. 
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-"XrT • MP VR ictivitY in a cell, compnsing 

^ tf A method for activating NF-kB activity 

u- tRCMA polypeptide havmg NF-kB 
contacting said cell with a recombinant BCMA po yp ^ ^ 

activating activity. — 

"~ • KTF VR activity in a cell, comprising 

^ A method for activating NF-kB activity 

BCMA polypeptide havi„gNF-kB ac.W_a«ng, acuv,,,, 

' „ f, BCMA polypeptWe for F=P»™S''P''^™'"""'" 

compos«,on for treating cancer, apop.os.s. 

response. 

n fa BCMA nucleic acid molecule for prepanng a 
.43r. Use ofa BCMA nut. , viral infection, or 

Von for treating cancer, apoptosis, a Mral mi 
, pharmaceutical composiuon for treating 

an inflammatory response. 
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MLQr4AGQCSQ NEYFDSLLHA 
GLSLIISLAV FVLMFLLRKI 
YTVEECTCED CIKSKPKVDS 
ISAR (SEQ ID NO: 1) 



CIPCQLRCSS NTPPLTCQRY 
SSEPLKDEFK . NTGSGLLGMA 
DHCFPLPAME EGATILVTTK 



Fig. 7A 
Human BCNA 



CNASVTNSVK GTNAILWTCL 
NIDLEKSRTG DEIILFRGLE 
TNDYCKSLPA ALSATEIEKS 



MAQQCFHSEY FDSLLHACKP 
LSLALFTISF LLRKMNPEAL 
TVEECTCEDC VKSKPKGDSD 
PTHTR (SEQ ID NO: 2) 



CHLRCSNPPA TCQPYCDPSV 
KDEPQSPGQL DGSAQLDKAD 
KFFPLPAMEE GATILVTTKT 



TSSVKGTYTV LWIFLGLTLV 
TELTRIRAGD DRIFPRSLEY 
GDYGKSSVPT ALQSVMGMEK 



Fig. 7B 
Mouse BCNA 
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1 . With regard to the elements of the international application: * 
I X I the international application as originally filed 
the description: 
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NONE 



, as originally filed 

filed with the demand 



NONE 



. filed with the letter of 



I x| claims: 

pages 

pages 

pages 

pages 



30-37 



NONE 



NONE 



NONE 



. as originally filed 

, as amended (together with any statement) under Article 19 
. filed with the demand 



, filed with the letter of 



I x| drawings: 

pages ]^ 

pages 

pages 



NONE 



NONE 



. as originally filed 

, filed with the demand 



I X| the sequence listing part of the description: 

pages NONE 

pages . 

pages 



, filed with the letter of 



NONE 



NONE 



, as originally filed 

, filed with the demand 



, filed with the letter of . 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language v\iuch is: 

I I the language of a translation furnished for the purposes of international search (imder Rule 23.1(b)). 
I I the language of publication of the international application (under Rule 48.3(b)). 

I I the language of the translation furnished for the pmposes of intemational preliminaiy examination (under Rules 55^ and/ 

or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the intemational application, the intemational 
preliminaiy examination was carried out on the basis of the sequence listing' 

□ contained in the intemational application in printed form. 

I I filed together with the intemational application in computer readable form. 

I I furnished subsequently to this Authority in written form. 

I I furnished subsequently to this Authority in computer readable fomi. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
mtemational application as filed has been furnished. 

The statement that the information recorded in conputer readable form is identical to the writen sequence listing has 
* — ' been furnished. 

4 nn The amendments have resulted in the cancellation of: 

the description, pages NONE 

the claims, Nos. NONE 

the drawings, sheets/fig NONE 



5- I I This report has been drawn as if (some of) the amendments had not been made, since they have been considered to go 
beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)).** 

» Replacemenx sheets which have been flmushed to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed" and are not annexed to this report since they do not contain amendments (Rides 70.16 
and 70.17), ' 

*Mnv re placement sheet containing such amendments must be referred to under item 1 and annexed to this retx>rt. 
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III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 



1 The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non obvious), or to be 
industrially applicable have not been and will not be examined in respect of: 

I I the entire international application. 

fx] claims Nos. 5-6. 17-18. 24-43 

because: 

I I ^® said international application, or the said claim Nos, _ relate to the following subject matter which 
— ' does not require international preliminary examination (specify). 



I I the description, claims or drawings (indicate particular elements below) or said claims Nos. _ are so 
unclear that no meaningful opinion could be formed (specify). 



|~| the claims, or said claims Nos. _ are so inadequately supported by the description that no meaningful 
opinion could be formed. 



HI 



no international search report has been established for said claims Nos. 5-6. 17-18. 24-43 



2. A meaningful international preliminaiy examination cannot be carried out due to the failure of ttie nucleotide and/or amino acid 
sequence listing to comply with the standard provided for in Annex C of the Administrative Instructions: 

□ 

the written form has not been furnished or does not comply with the standard 
□ the computer readable form has not been furnished or does not comply with the standard. 
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V. Reasoned statemenl under Article 35(2) with regard to novelty, invenHve step or industrial applicability; 
citations and explanations supporting such statement 



1. statement 

Novelty (N) Claims 1-4. 7-16. 19-23 YES 

Claims NONE jsjO 

Inventive Step (IS) Claims NONE YES 

Claims 1-4. 7-16. 19-23 NO 

Industrial Applicability (lA) Claims 1-4, 7-16, 19-23 YES 

Claims NONE jsfQ 



2. citations and explanations (Rule 70.7) 

Claims 1-4, 7-16, 19-23 lack an inveniive step under PCT Article 33(3) as being obvious over either Bimbaum (Journal of 
Virology, April 1994, Vol 68, No. 4, pages 2521-2528) or Clem et al (Science, vol. 254, November 1991. pages 13881390)in 
view of Zolotukin et al (US Patent 5,874,304). 

Bimbaum et al leach an apoptosis -inhibiting gene from s nuclear polyhedrolsis vims eojoding a polypeptide with Cys/His 
sequence motifs. The reference teaches that two different baculoviras genes are known to be able to block apoptosis triggered 
upon infection of Spodoptera ceUs with p35 mutants of the insea baculovinis. the refercoje teadies thai using a gei^tic 
con^)lementation assay to identify additional genes which inhibit apoptosis during infection with p35 mutant, Bimbaum et al have 
isolated a gene OpNPV tiiat was able to functionally substitute for AcMNPV p35. The reference teaches a~ genetic 
complementation assay to identify apoptosis blockin genes in which SF-21 cells were cotransfeoed with vAcAnh and the test 
baculovirus DNA and transfected cells monitored. 

Clem et al teach prevention of Apoptosis by a Baculovirus Gene during Infection of Insea cells. Clam et al have identif ied 
annihilator (vAcAnh) viral mutant was identified from expression vectors. The mutani caused premature death of Spodoptera 
frugiperda cells. 

Either Bimbaum et al or Clem et al have taught the method of identification of the gene or the polypeptide involved 
in apoptosis using the a reporter gene expression However, Zolotukhin et al teach humanized green fluorescent protein genes 
and methods of use of tiie protein in several assays. The reference teaches that the ejqpression veaors may comprise a multiple 
cloning sites that is operatively positioned downstream from gfp gene sequence, and these vectors are useful in addition to the 
uses in creating C terminal fusion proteins by cloning a second protein encoding DNA segment into tiie multiple cloning site 
so that it is contiguous and in frame with the gfp sequence. The reference teaches that die recombinant host cells will e:q>iess 
GFP to produce encoded GFP protein, preferably in amount sufficient to allow GFP detection by its fluorescence. The refereiKC 
leaches tiiat the expression vectors comprising a GFP (Continued on Supplemental Sheet.) 
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gene operatively linked to a seleaed gene, and the fusion protein being produced in amount sufficient to allow cell detection 
by detecting the green fluorescence of GFP. Thus, it would have been obvious to use the GFP protein as a reporter gene in the 
method of identifying the gene involved in the apoptosis. 

Claims 1-4, 7-16. 19-23 meet the criteria set out in PCT Article 33(2) and (4). because the prior art does not teach the method 
of identifying the a polypeptide which increases or decreases gene expression from a promoter using a GFP r^orter gene 
operably linked to said promoter. 
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METHOD FOR CLONING SIGN AT. 
TR.\NSDUCTION INTERMEDIATFS 

Statement as to Federally Sponsored Research 
This research has been sponsored in part by grant number AI27849 
from the National Institutes of Health. The U.S. government has certain rights 
to this invention. 

Background of the Invention 

The invention relates to methods of identifying polypeptides and 
compounds which regulate gene expression. 

Pharmaceuticals have historically been developed by testing libraries 
of up to several thousand compounds in laboratory animals, usually one 
compound at a time. The slow pace of the process, and its unsuitability for 
screening large numbers of diverse compounds, led to the development of 
approaches based on assays that can be completed quickly and ex vivo. With 
these approaches, the pharmaceutical drug discovery process has evolved into a 
catenation of several, sometimes partially concurrent, phases. 

In the first phase, target identification and validation, a candidate 
target for a potential drug is identified by various means. These means may 
include hypotheses formed from the study of the pathophysiology' of disease in 
humans or experimental animals, analysis of candidate signal transduction 
pathways in vitro, natural experiments such as genetic disorders of humans or 
other animals, results from targeted or random gene disruptions in model 
organisms, or disclosures by competitors. The projected consequences of a 
hypothetical drug interacting with its intended target to stimulate, block or 
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modulate the target activity is then tested. If the outcome of testing suggests 
that a drug affecting the target would have the desired physiological 
consequences in an intact organism, the target is said to be validated. Neither 
target identification nor validation are formally required, and, indeed, the 
5 historical process had neither of these phases. 

Present models for pharmaceutical industry productivit>' emphasize 
the importance of efficiently identifying large numbers of validated targets and 
of developing rapid screens for their activit>\ The impetus for this model is the 
widespread belief that targets cannot be predicted effectively, so that a large 

10 number of targets must be evaluated to develop a small number of drugs. 

Hence, there remains an important need across the industry to accelerate the 
pace of target discovery and validation. 

Once the target has been identified and validated, an assay must be 
created to allow large libraries of synthesized or natural compounds to be tested 

15 for their ability to interact with the target. WTien a compound is identified 

which acts on the target specifically, it is usually said to be a hit. The definition 
of a hit generally encompasses compounds that pass various secondary tests to 
assure that their activit\' is specific to the target of interest. From the various 
hits that are generated, one or more structures are chosen to represent the 

20 starting point for a program of systematic modification of the chemical 

structure. These structures are called lead compounds, and they are frequently 
selected from hits on the basis of their compatibilit>' with directed synthesis 
programs, expected toxicity, or expected absorption, distribution, metabolism 
or excretion characteristics. Again, the process encompasses a wide range of 

25 practices and although they are usually distinguished, hits and lead compounds 
are sometimes referred to in ways that make them appear equi\'alent. 
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Summarv of the Inventinn 
In general, the invention provides a novel, rapid method for 
identifying members of selected signal transduction pathways which are targets 
for drug design. 

5 In a first aspect, the invention features a method for identifying a 

polypeptide which increases gene expression from a promoter, including (a) 
contacting a library of pol>T3eptides with a cell that expresses a recombinant 
anti-cell death gene and that contains a reporter gene operably linked to a 
promoter, whereby expression of the reporter gene is increased if the library 

10 includes a pol>TDeptide which increases gene expression from the promoter; (b) 
determining w^hether the reporter gene expression is increased in the cell as a 
result of contact with the polypeptide librar>% and (c) if reporter gene 
expression is increased, identifying a polypeptide of the library which increases 
reporter gene expression. 

15 In a second aspect, the invention features a method for identifying a 

polypeptide which decreases gene expression from a promoter, including (a) 
contacting a library of polypeptides with a cell that expresses a recombinant 
anti-cell death gene and that contains a reponer gene operably linked to the 
promoter, whereby expression of the reporter gene is decreased if the library 

20 includes a polypeptide which decreases gene expression from the promoter; 

(b) determining whether reporter gene expression is decreased in the 
cell as a result of contact with the poljT^eptide library; and 

(c) if reporter gene expression is decreased, identifying a pol>peptide 
which decreases reporter gene expression. 

25 In a third aspect, the invention feamres a method for identifying a 

poh^peptide which modulates activation of a transcription factor activation 
domain, including (a) contacting a library of polypeptides with a cell that 
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expresses a recombinant anti-cell death gene and a chimeric transcription factor 
consisting of a yeast or bacterial DNA binding domain fused to a mammalian 
transcription factor activation domain, and that contains a reporter sene 
operably linked to a promoter consisting of a basal promoter and binding sites 
5 for the DNA binding domain, whereby expression of the reporter gene is 

altered if the library comprises a polypeptide which modulates acti\'ation of the 
transcription factor activation domain; (b) determining whether reporter gene 
expression is altered in the cell as a result of contact with the polypeptide 
library; and (c) if reponer gene expression is altered, identifying a polypeptide 

10 which modulates reporter gene expression. 

In a fourth aspect, the invention features a method for identifying a 
compound which modulates gene expression from a promoter, including (a) 
contacting a library of compounds with a cell that expresses a recombinant anti- 
cell death gene and that contains a reporter gene operably linked to the 

15 promoter, whereby expression of the reponer gene is altered if the library 
includes a compound which modulates gene expression fi-om the promoter; 

(b) determining whether reporter gene expression in the cell is 
altered as a result of contact with the compound library; and 

(c) if reporter gene expression is altered, identifying a compound 
20 from the library which modulates reporter gene expression. 

In a fifth aspect, the invention features a method for identifying a 
compound which decreases gene expression, including (a) contacting a library 
of compounds with a cell expressing (i) a recombinant anti-cell death gene; (ii) 
a second gene encoding a polypeptide; and (iii) a reporter gene that would have 
25 decreased expression if the function of the polypeptide was blocked; (b) 

determining whether expression of the reporter gene is decreased as a result of 
contact with the compound library; and (c) if expression of the reporter gene is 



wo 00/50633 



PCT/USOO/04925 



-5- 

decreased, identifying a compound from the librar\' which decreases reporter 

gene expression. 

In preferred embodiments of the first, second, and third aspects, a 

library of DNA molecules encoding the library of polypeptides are expressed in 
5 a cell. The cell can be the same cell that expresses a recombinant anti-cell 

death gene and that contains a reporter gene operably linked to a promoter (in 

which case the polypeptide is produced by the same cell that expresses a 

recombinant anti-cell death gene and that contains a reporter gene operably 

linked to a promoter), or the polypeptide can be produced by a cell other than 
10 the cell that expresses a recombinant anti-cell death gene and that includes a 

reporter gene operably linked to a promoter; preferably the DNA molecules are 

expressed from a high-efficiency expression system. 

In other preferred embodiments of the first, second, and third aspect, 

the library of DNA molecules is introduced to the cell by transfection, and the 
15 mean number of DNA molecules introduced by transfection to the cell is at 

least 25. Preferably, the mean number of DNA molecules introduced by 

transfection into the cell is at least 100 or even 500. 

In still other preferred embodiments of the first, second, and third 

aspect, the polypeptide is selected from the group consisting of an extracellular 
20 ligand, a cell surface receptor, and a signal transduction intermediate, and the 

DNA molecules are expressed from a high-efficiency expression system. 

In preferred embodiments of the first, second, third, fourth, and fifth 

aspects, step (c) includes (i) dividing the library into two or more libraries with 

less complexity; and (ii) repeating steps (a) and (b) until a polypeptide which 
25 activates reporter gene expression is identified; the promoter can derived fi-om 

a mammal and the library of polypeptides can include pol>peptides derived 

from a bacterium or a virus. 
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In other preferred embodiments of the first, second, third, fourth, and 
fifth aspects, the promoter is a heterologous promoter, the reporter gene is GFP, 
the anti-cell death gene is selected from the group consisting of bcl family 
members, lAP family members, and crmA, and the cell is selected from the 

5 group consisting of CHO, CD-I , Cos, 293, HeLa, BHK, or L cells. 

In another aspect, the invention feamres a method for determining 
whether a compound modulates NF-kB biological activity, the method 
including the steps of; (a) providing a cell expressing a BCMA polypeptide; (b) 
contacting the cell with a candidate compound; and (c) measuring the level of 

10 expression of the BCMA polypeptide in the cell, wherein a change in the level 
of expression of the BCMA polypeptide in the cell, relative to a cell not 
contacted with the candidate compound, identifies the candidate compound as a 
compound that modulates NF-kB activity. 

In still another aspect, the invention features a method for 

15 determining whether a compound modulates BCMA biological activity, the 
method including the steps of: (a) providing a BCMA polypeptide; (b) 
contacting the polypeptide wnth a candidate compound; and c) measuring the 
level of biological activity of the BCMA polypeptide, wherein a change in the 
level of biological activity of the BCMA polypeptide, relative to a polypeptide 

20 not contacted with the candidate compound, identifies the candidate compound 
as a compound that modulates BCMA activity. The BCMA polypeptide can be 
in a cell or in a cell-free system. A preferred BCMA biological activity is the 
modulation of NF-kB biological activity (e.g., the modulation of transcription 
by NF-kB) or NF-kB expression. Preferably, the BCMA polj'peptide includes 

25 a polypeptide sequence having substantial identity to amino acids 98 to 1 64 of 
human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse BCMA (SEQ 
ID NO: 2). 
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In yet another aspect, the invention features a method for 
determining whether a compound modulates NF-kB activity, the method 
including the steps of: (a) providing a BCMA polypeptide; (b) contacting the 
polypeptide with a candidate compound; and (c) detecting the binding of the 
5 candidate compound to the polypeptide, wherein a candidate compound that 
binds to the polypeptide is a compound that modulates NF-kB biological 
activity. The BCMA pol>T)eptide can be in a cell or in a cell-free system. 
Preferably, the BCMA polypeptide includes a polypeptide sequence having 
substantial identity to amino acids 98 to 164 of human BCMA (SEQ ID NO: 1) 

10 or amino acids 97-163 of mouse BCMA (SEQ ID NO: 2). 

In another aspect, the invention features a substantially purified 
poljT^eptide consisting of a BCMA polypeptide molecule lacking a BCMA 
extracellular domain. Preferably, the BCMA polypeptide includes a 
polypeptide sequence having substantial identity to amino acids 98 to 164 of 

15 human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse BCMA (SEQ 
ID NO: 2). Other preferred polypeptides are BCMA pol>^eptides that 
modulate NF-kB activity. 

In yet another aspect, the invention features an NF-kB modulator 
including a BCMA polypeptide covalently linked to a heterologous compound. 

20 Preferably, the BCMA polypeptide includes a polypeptide sequence having 

substantial identity to amino acids 98 to 164 of human BCMA (SEQ ID NO: 1) 
or amino acids 97-163 of mouse BCMA (SEQ ID NO: 2). Preferably, the 
modulator modulates NT-kB biological activity or expression. 

The invention also features methods for activating NF-kB activit>' in 

25 a cell by contacting the cell with a recombinant BCMA polypeptide having NF- 
kB activating activity or by contacting the cell with a recombinant nucleic acid 
molecule encoding a BCMA polypeptide having NF-kB activating activity. 
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In related aspects, the invention features the use of BCMA 
polypeptides and nucleic acids for preparing pharmaceutical compositions for 
treating cancer or apoptosis. 

By a "reporter gene'' is meant a DNA or RNA sequence which 
5 encodes a reporter protein that is capable of being readily detected either inside 
or outside a cell. 

The reporter gene is operably linked to a promoter which shows low 
spontaneous activation, so that the activity of the reporter protein in the 
presence of the activating polypeptide be at least two standard deviations 

10 greater than the activity in its absence. Various methods of increasing the 
sensitivity of reporter genes are known in the art, including: deletion from 
natural genes of inhibitory sequences, which may be found both within and 
external to the transcribed portion and which may affect mRNA formation, 
stability, or translational efficacy; addition of efficiently utilized introns to 

15 increase the rate of formation of mature mRNA; multimerization of upstream 
acti\'ation regions or binding sites for known transcriptional activators; 
increasing the copy number of the reporter gene; and protection of the activir>' 
of the reporter gene from adventitious stimulatory or inhibitory activity in 
flanking DNA by insening the reporter gene between matrix attachment 

20 regions or chromatin insulator sequences. 

Many different, types of reporter proteins are known in the art. They 
frequently comprise proteins not normally found, or present in minor amounts, 
in some cells; they include enzymes that detoxify antimicrobial agents, such as 
aminoglycoside or aminocyclitol phosphotransferases or acetyltransferases, 

25 beta-lactamases or chloramphenicol acetyltransferase; enzymes of diverse 

origin that catalyze chromogenic, fluorogenic, or chemiluminescent reactions in 
the presence of exogenous substrates, such as beta-galactosidase, beta- 
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glucuronidase, alkaline phosphatase, catechol 2,3-dioxygenase, or various 
peroxidases; enzymes that catalyze photoreactions, such as bacterial or firefly 
luciferases; enz>^mes, like glycosyl transferases, that generate nonproteinaceous 
structures easily detected by antibodies, lectins, or cognate binding proteins; 
5 proteins easily monitored upon cell surface expression or secretion such as 

surface or secreted antigens for which corresponding antibodies or recognition 
proteins are known; and proteins which catalyze the synthesis of or 
stoichiometrically embody, fluorescent structures, without exogenous 
substrates, such as the jellyfish fluorescent proteins (e.g., GFP). 

10 By "operably linked" is meant that the gene for the reporter protein is 

positioned adjacent to a promoter which directs transcription of the gene and, 
ultimately, facilitates expression of the reporter protein. 

By "promoter" is meant any minimal nucleic acid sequence 
sufficient to direct transcription of the reporter gene. The promoter is one 

15 which is activated by binding to a polypeptide. Examples of promoters useful 
in the invention are promoters which are normally linked to genes which are 
expressed when a cell is in a pathologic or disease state. (e.g., cancer, 
inflammation, or due to bacterial or viral infection), and the protein products of 
which are directly or indirectly responsible for this state. Suitable promoters 

20 include, but are not limited to, the NF-kB promoter, the interleukin-2 promoter, 
and the HIV-1 long terminal repeat promoter. 

A library of DNA molecules refers to a set of DNA molecules, each 
in a DNA expression vector. Preferably, the DNA expression vector displa\ s 
high efficiency such that the level of expression is high. While in most cases, 

25 the library includes DNA molecules encoding tens, hundreds, or even 

thousands of different polypeptides, DNA molecules in a library can also 
encode only one polypeptide (for example, during the final steps of a sib 
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seiection). A library with hundreds of different DNA molecules is considered 
to have greater "complexity" than a library with five different DNA molecules. 
Like DNA libraries, compound libraries can have different degrees of 
complexity. One feature of the invention is a method which allows for 
5 following an activity of interest through the sequential screening of libraries 
with less and less complexity. 

By "candidate compound" is meant a chemical, be it naturally- 
occurring or artificial, that is surveyed for its ability to modulate BCMA or NF- 
kB biological activity. Candidate compounds may include, for example, 

10 peptides, polypeptides, antibodies (and fi-agments thereof), synthesized organic 
molecules, naturally occurring organic molecules, nucleic acid molecules, and 
components or derivatives thereof. The candidate compounds may be screened 
using any of the methods described herein using an animal, a cell, a lysate or 
extract derived fi-om a cell, or a molecule derived from a cell. The measuring 

15 may be, for example, for the purpose of detecting altered gene expression, 
altered RNA stability, altered protein stability', altered protein levels, altered 
protein phosphorylation, or altered protein biological activity. TTie means for 
measuring may include, for example, antibody labeling, immunoprecipitation, 
phosphorylation assays, and methods known to those skilled in the art for 

20 detecting nucleic acids. 

By "modulating" is meant changing, either by decrease or increase. 
By "BCMA polypeptide" is meant a polypeptide that has substantial 
identity to human or mouse BCMA as shown in Figs. 7A and 7B, respectively, 
over a region of t\vent>^ consecutive amino acids and has a BCMA biological 

25 activity. 

By "substantial identity" is meant that two polypeptide sequences, 
when optimally aligned, such as by the GAP or BESTFIT programs using 
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default gap weights, share at least 80% sequence identity, more preferably at 
least 90% sequence identity, and most preferably at least 95% or even 99% 
sequence identity or more. 

By "heterologous compound" is meant a polypeptide, chemical, or 
detectable label (e.g., biotin) that is not naturally associated, in this case, with 
the polypeptide. 

By '^BCMA biological activity" is meant BCMA-mediated 
modulation of NF-kB biological activity, or expression or the binding of an 
antibody that specifically binds a BCMA polj^^eptide. 

By "NF-kB biological activity" is meant any function performed by 
activated NF-kB, 

The invention features a method of screening DNA and compound 
libraries for their ability to modulate reporter gene expression in a cell which is 
expressing a recombinant anti-cell death gene. The expression of this gene 
allows the cell to survive in conditions which would otherwise lead to its death 
and, as a result, failure of the screening procedure. 

This method is suitable for identifying polypeptides that modulate 
transcription from a selected promoter. These polypeptides are, in turn, targets 
for dmgs. The method is also suitable for identifying compounds which either 
mimic or block function of a polypeptide which itself modulates transcription 
from a selected promoter. 

Hence, the invention features a new, efficient multistep method for 
identifying lead compounds which modulate expression from a promoter for 
development of pharmaceutical compounds. 

The invention also features a ne\\ NF-kB activator, BCIVIA. As NF- 
kB is involved in numerous cellular processes and disease states, BCMA is 
useful for the treatment of disease. BCMA is also useful for the identification 
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of compounds that modulate its expression or biological activit\ . These 
compounds may be developed as drugs, or used as lead compounds for the 
purpose of identifying drugs. 

Other features and advantages of the invention will be apparent from 
the following detailed description, the drawings, and the claims. 

Brief Description of the Drawings 

Fig. 1 is a schematic illustration of a reporter cell system for rapid 
identification of cDNAs encoding polypeptides of interest. A cDNA library is 
transfected into a cell which includes a reporter gene (e.g., GFP or Thy-1) 
operably linked to a promoter (in this case the NF-kB promoter). If the 
polypeptide expressed by one of the plasmids in the reporter cell is capable of 
inducing expression from the NF-kB promoter, the reporter gene is expressed, 
and the detectable marker protein is produced. 

Fig. 2 is a schematic illustration of a cDNA library being divided 
into smaller libraries. Each of the libraries is then transfected into reporter 
cells. Libraries which induce reporter gene expression are then further divided 
until the cDNA(s) encoding the polypeptide which induces reporter gene 
expression are isolated. 

Fig. 3 is a schematic illustration of three stages of enrichment for an 
intracellular signal transduction intermediate. The ratios represent the 
occurrence of the cDNA encoding the reporter gene-inducing pol^q^eptide 
compared to the total cDNAs. Note that, during transfection, each cell receives 
more than one plasmid. 

Fig. 4 is a schematic illustration showing how secreted factor can be 
distinguished from intracellular signal transduction intermediates such as 
receptors or kinases. Soluble ligands diffiise through the medium to engage 
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receptors on uniransfected cells, which can lead to reporter gene expression in 
most, if not all, cells. Intracellular effectors are restricted to the transfected 
cells. 

Fig. 5 is a schematic illustration of arrayed clones. N- elements, 
pooled into N columns and N rows, can be assayed with 2N transfections. In 
the example provided, 4 rows and 4 columns were positive. 4- or 16 
transfections are then required to identify the intersections unambiguously. 

Fig, 6 is a schematic illustration of an indexed library protocol. The 
method is similar to the one described in Fig. 1 to Fig. 5, except that individual 
plasmids are grown as individual cultures. Libraries of plasmids are then 
prepared and assayed. This method facilitates the rapid recovery of individual 
plasmids, and prevents the loss of positives when a library is divided into 
libraries of low^er complexity. 

Fig. 7A is a schematic illustration showing human BCNA 
polypeptide sequence (GenBank accession number Q02223). The putative 
transmembrane domain is indicated in bold. 

Fig. 73 is a schematic illustration showing mouse BCNA 
polypeptide sequence (GenBank accession number AAC23799). 

Fig. 8 is a schematic illustration showing that the intracellular 
domain of human BCMA was capable of activating NF-kB. A series of fusion 
proteins were constructed using the CD5L leader sequence, IgG Fc, a CD7 
transmembrane domain, and amino acids from human BCMA (SEQ ID NO: 1) 
as follows: Ig7bcma"amino acids 78-184; AC20"amino acids 78-164; AC40-- 
amino acids 78-144; AC60~amino acids 78-124; AC80-amino acids 78-104: AN20- 
-amino acids 98-1 84; AN40-amino acids 1 1 8-184; AN60-amino acids 138-1 84; 
AN80--amino acids 158-184. 
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Detailed Descriptiof^ 
The present invention provides a method for the rapid identification 
of molecules involved in regulation of specific signal transduction pathways. It 
can be used both as a target identification tool and as a rapid assay for drug 
5 effectiveness, and as such can significantly reduce the amount of time needed 
to go from target to hit. 

The method features an expression cloning approach that identifies 
polypeptides that have the ability to activate reporter genes (Fig. 1). Such 
polypeptides are called here activating polypeptides. Expression cloning is a 
10 technique which identifies polypeptides solely on the basis of their ability to 
generate an observable activity of interest. In order for that activity to be 
identified, the polypeptide must generally be expressed, either in vivo or in 
vitro, and a suitably sensitive assay must be available to detect the activity after 
the polypeptide has been expressed. In the present method, it is preferred that 
15 the activity that is detected is dependent on the output of a reporter gene (Fig. 
1). 

Although the use of expression cloning in conjunction with reporter 
genes is not new, existing methods of application have not resulted in highly 
efficient systems for the identification of large numbers of new molecules. The 

20 present approach is a high throughput system for the identification of cDNA 

clones encoding poljq^eptides that induce reporter gene activity. Some features 
of the high throughput system of the present invention are: (1 ) a very high 
efficiency cDNA expression plasmid; (2) an easily detected reporter molecule; 
(3) features to prevent the death of the reponer cell due to toxicity of expressed 

25 genes; and (4) a method for the introduction of large numbers of plasmids into 
multiple cells. 

Expression cloning can be carried out by either of two general 
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paradigms, positive selection, or sib selection, also known as pool division. By 
positive selection is meant a method for the enrichment of cells, \'iruses or 
genetically linked assemblies of proteins and the nucleic acids that encode them 
by physically separating the cells or genetically linked assemblies from the 
5 much larger collection of cells or assemblies that encode molecules of little 

interest. Because the nucleic acids that encode the polypeptides of interest are 
physically linked to the cells, viruses, or other assemblies that they comprise, 
the nucleic acid is physically purified by positive selection and the process can 
be repeated until a single nucleic acid is found that encodes the activity of 
10 interest. It is desirable in positive selection schemes to develop methods for 

association of a single nucleic acid with a single protein assembly or cell. Thus 
in positive selection it is desirable to identify and use methods for the 
introduction of nucleic acids into cells that result in a single nucleic acid 
species per cell. 

15 The method of the invention uses sib selection, a process of 

screening in which pools of molecules that are derived from cognate libraries of 
nucleic acids are assayed for activity, and positive libraries are detected by 
some signal (e.g., expression of a reporter gene). The nucleic acids that make 
up the library are then separated into libraries with less complexity, which are 

20 then reassayed and redivided until a single nucleic acid is found that encodes 
the polypeptide with the desired activity (Figs. 2, 3, and 4) Because sib 
selection depends on the detection of the activity of aggregates of nucleic acids, 
it is often advantageous to use methods for the transfection or introduction of 
nucleic acids into cells that result in a large number of nucleic acid species per 

25 cell. Sib selection schemes, like positive selection methods, can also be carried 
out entirely in vitro. 
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Features of the Invention 
Transfection Method 

For sib selection to be an efficient method of cloning genes, 
according to the invention, it is important that a large number of DNA molecule 
5 are introduced into each cell. Many methods of introducing DNA molecules 
are knov^oi in the art, including microinjection, complexation with positively 
charged synthetic polycations such as DEAE dextran, polybrene, polylysine, or 
polyarginine, complexation with histones and other basic proteins, 
complexation with cationic lipids or related amphipathic molecules, 

10 condensation with polyethylene glycol or polyhydroxybutyrate, coprecipitation 
with calcium phosphate, electroporation, scrape loading or partial rupture, and 
fusion with bacterial or microbial spheroplasts. Preferred among these are 
methods that can be easily carried out in parallel and that result in the co- 
introduction into the reporter cell of multiple nucleic acids that encode different 

15 species. Moreover, the method of transfection will preferably provide, on 
average, at least 25 DNA molecules per cell. More preferably, the mean 
number will be at least 100 or even 500 DNA molecules per cell. The methods 
which achieve the preferred results include calcium phosphate coprecipitation, 
complexation with polycations or cationic lipids, and condensation with 

20 uncharged polymers such as polyethylene glycol. 

The libraries of nucleic acid molecules encoding potentially 
activating polypeptides can also be created in a biologically active assembly, 
such as a virus or viral transducing particle, which is capable of introducing 
itself into the reporter cell directly. In such a case a cDNA library is prepared 

25 in the viral vector, and libraries of active virus or transducing particles are 
applied to the reporter cell. In general the reporter cell will have been 
previously engineered to contain a reporter gene, but the reporter gene may also 
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be introduced concurrently with the activating nucleic acid. 

High Efficiency- cDNA Expression Vector 

To detect ectopic activation of a signal transduction pathway it is 
5 desirable to greatly overexpress proteins that act as signal transduction 
intermediates. 

The invention uses a high efficiency cDNA expression system to 
produce proteins in the reporter cell. Such a system is provided by the use of a 
very strong promoter, such as, for example, the elongation factor 1 alpha (EF- 

10 la) promoter; 3' untranslated region (3* UTR) and polyadenylation consensus 
(poly(A)) sequences from the human growth hormone gene; and the human 
IgGl hinge-CH2 intron. Other strong promoter and nonpromoter elements are 
known in the art (for example, the murine or human cytomegalovirus 
immediate early gene promoters, globin introns, and 3' UTRypoly(A) 

15 sequences). 

Cell Death Inhibition 

Another important feature of the present system is a method to 
prevent the death of cells overexpressing proteins. Such a method is important 

20 for two reasons: signal transduction intermediates themselves can lead to cell 

death if they are expressed at high levels, and, in addition, if libraries of nucleic 
acids are transfected, the presence of even a low frequency of nucleic acids 
encoding toxic proteins can interfere with detection of the desired signal. The 
latter effect can be predicted to have greater impact as the size of the library 

25 increases. 

There are se\ eral known methods to prevent the demise of cells 
undergoing programmed cell death, or apoptosis. Both viral and cellular 
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antagonists of cell death are known, and among them are species that act 
upstream in the apoptosis pathway, or at multiple levels. Examples of the 
former include dominant negative forms of cell death proteins, such as FADD 
or TRADD, and cellular inhibitors of apoptosis, such as members of the viral or 

5 cellular lAP family. Examples of proteins that act at multiple levels, or 

relatively dowoistream, include the orthopoxvirus CrmA and baculovirus p35 
proteins, members of the cellular Bel family, and both peptide and nonpeptide 
inhibitors of caspases, the cysteine proteases which act in a zymogen cascade to 
generate the death program. For the prevention of apoptosis in expression 

10 cloning, combinations of broad spectrum antiapoptotic proteins with different 
mechanisms of action are preferred, such as CrmA and Bcl-xL. 

Examples 

Identification of gene products leading to the activation of the transcription 

15 factor NF-kB 

We established a reporter cell line by stably transfecting the 
commercially-available human embryonic kidney cell line, 293 EBNA, with 
two constructs: a reporter construct consisting of NF-kB promoter elements 
upstream of the green fluorescent protein (GFP); and a eukaryotic expression 

20 construct that expresses two anti-cell death molecules, CrmA and Bcl-xL. The 
former functions as a readout for the presence of signals that activate the NF- 
kB signal transduction pathway, whereas the latter prevents these cells from 
undergoing programmed cell death in the event that pro-apoptotic signals are 
present. The expression in the reporter cell line of NF-kB-activating 

25 molecules, including receptors (e.g. tumor necrosis factor-a receptor 1 ), ligands 
(e.g. tumor necrosis factor-a) and intracellular signaling proteins (e.g. RIP), 
results in the robust production of GFP. The level of GFP production, as a 
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measure of NF-kB activation, can be assayed using a fluorescent microscope or 
flow cytometr>'. High level expression of these various NF-kB-activating 
molecules was achieved by placing their cDNAs under the control of the very 
strong EF-1 a promoter present in the commercially available plasmid PEAKS 
5 (Edge Biosystems). Thus, one can transfect into these cells cDNA libraries and 
identify individual cDNAs which are capable of NF-kB activation. 

A cDNA library, prepared from activated human T cell mRNA and 
cloned into the PEAKS vector, was subdivided into smaller libraries of 
approximately 500 cDNA clones each, and DNA was prepared from each 

10 library. The DNA from the libraries were transfected into the reporter cell line 
by calcium phosphate precipitation following published protocols, e.g., 
Ausubel et al. et al., 1997, Current Protocols in Molecular Biology?, Wiley 
Interscience. Cells were allow^ed to continue to grow for 48 hours and assayed 
for GFP production. cDNA libraries that gave a positive GFP signal, as 

15 defined by the appearance of at least 0.1% of bright cells by fluorescence 

microscopy, were screened further by sub-dividing into libraries of 50 cDNAs 
each (Fig. 2). DNA from each library was prepared, transfected into the 
reporter cell line and assayed for GFP production. The process was repeated 
until a single cDNA clone that induced a positive GFP readout was obtained. 

20 The screening procedure has resulted in the cloning of DNAs encoding known 
NF-kB-activating molecules such as surface receptors (DR3, FAS), soluble 
ligands (interleukin-1, TRAIL, CD40 ligand), intracellular signaHng molecules 
(small molecular weight GTPase rho). Also identified were a previously 
known molecule having no known function (BCMA) and novel molecules. 

25 The inclusion of two anti-cell death genes in the transfected cells are 

likely to have aided in the expression cloning of at least some of the above- 
mentioned DNAs. It has been established that expression of either FAS or DR3 
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in a cell would induce apoptosis in that cell. The fact that each was retrieved 
substantiates the improved nature of the expression cloning method of the 
present invention. Similarly, as the method of the present invention preferably 
employs high copy number, it is likely that expression of anti-cell death genes 

5 allows for cells that otherwise would have died, due to a high number of 
inserts, to survive. 

The method is more rapidly and efficiently applied by using indexed 
arrays of bacterial cultures, in which each culture is derived from a single 
colony, and, thus, represents an independently derived cDNA expression 

10 plasmid (Fig. 5). By pooling small amounts of the cultures from rows and 
columns of large arrays and preparing DNA from each of the libraries, it is 
possible to assess the relative activity of every row and every column, thereby 
reducing labor dramatically (Fig. 6). For example if there are 90,000 
individual/clones arrayed in a 300 by 300 matrix, the 300 rows and 300 

15 columns can be transfected to determine all of the rows and all of the columns 
that bear positive clones. Hence the array can be indexed in only 600 DNA 
preparations and transfections. If there is only one positive culture, the job is 
complete. If there are multiple positive cultures, then in the worst case, each 
row and each column will have only one positive culture, and if there are x 

20 positives, then there are x" possible intersections. As long as x is a small 

number, though, the work involved is quite modest. For example if x were 1 0, 
the entire array of 90,000 clones could be screened in only 700 DNA 
preparations and transfections. 

Variations of the experimental approach outlined above are 

25 applicable depending on the particular system that is to be examined. The 

reporter system and cell lines can be adapted to the promoter to be investigated. 
For instance, the NF-kB promoter in the GFP reporter construct can be replaced 
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by the interleukin-2 promoter, introduced into a T lymphocyte cell line and 
screened for molecules that regulate the signal transduction pathway leading to 
the expression of interleukin-2 in T lymphocytes. Other promoters can be 
drawn from viral sources, such as the HIV-1 long terminal repeat (LTR) 

5 promoter, or other inducible promoters of viral origin. The calcium phosphate 
transfection procedure can be changed to another method more suitable for the 
particular reporter cell type. The alternative transfection protocols could be 
based on electroporation, cationic lipids, DEAE-dextran, spheroblasts fusion or 
viral-mediated delivery. 

10 A variation to the method described above is to screen for molecules 

that will turn off expression from a promoter. For instance, the NF-kB reporter 
in the reporter cell line is activated in response to interleukin-l stimulation. By 
including interleukin-l in the culture medium, one can then search for 
molecules that will inhibit the interleukin-l dependent expression from the NF- 

15 kB promoter. Depending on the cDNA library that is used, these molecules can 
be either ones that are naturally negative regulatory or they can be mutant 
versions that behave in a dominant inhibitory manner. Similarly, screens can 
be conducted with a combinatorial library to look for small pharmacological 
molecules that will negatively interfere with the pathway (see below). 

20 The screening methods described above are well-suited for screening 

for genes from one organism that interact with a pathway in another organism. 
A prime example is screening the expressed genomes of vimses, bacteria or 
other pathogens for genes that, when expressed, might interact with the NF-kB 
pathway, This method, utilized by these pathogens to alter the immune 

25 response to their advantage, can identify potential targets for pharmacological 
interventions. 

The reporter cell may also be provided with proteins which increase 
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the efficiency of the reporter gene. These proteins arc usually introduced by 
transfection of an expression plasmid that encodes them. Proteins which 
increase the efficiency of the reporter gene may provide essential elements of a 
signaling pathway which are not otherwise present in the reporter cell, or may 
5 consist of artificial proteins that act to integrate, amplify, or selectively respond 
to signals from the pathway of interest. Of special relevance are artificial 
transcriptional activators which consist of a DNA binding element that interacts 
specifically with cognate binding sites in an artificial promoter and a pathway- 
specific transcriptional activator element that responds to activation of the 

10 pathway with a change in transcriptional activity. Such artificial activators are 
known in the art as fusions betv^^een bacterial or yeast DNA binding proteins 
and mammaHan transcription factor activation domains. Such activation 
domains provides sites for protein binding, ligand-activated conformational 
change or post-translational modification, that increase the transcription- 

15 promoting capabilities of the artificial activator. Examples of artificial 

activators that are commercially available or known in the art include LexA and 
Gal4 fusions with c-Jun, Elkl, CREB, c-Fos, ATF2, CHOP, and members of 
the nuclear hormone superfamily. 

In one example, a cell which contains (i) a recombinant anti-cell 

20 death gene, (ii) a chimeric transcription factor consisting of the Gal4 DNA 

binding domain fused to the c-Jun activation domain, and (iii) DNA encoding 
GFP operably linked to regulatory sequence consisting of a basal promoter and 
Gal4 binding sites is used to identify pol>T3eptides or compounds which 
modulate c-Jun activation. If, for example, expression of a polypeptide leads to 

25 c-Jun phosphorylation (i.e., activation), then increased GFP expression would 
result. 
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BCMA is an activator of NF-kB 

Using the methods described herein, we identified BCMA as an NF- 
kB activator. BCMA was discovered by molecular analysis of a T(4;16) 
translocation characteristic of a human T cell lymphoma (Laabi et al., EMBO J. 
5 11: 3897-3904, 1992), and its function was not previously known. 

Based on the present discovery, BCMA is a target for drug discovery 
or rational drug design. For example, a compound that modulates BCMA 
expression or biological activity will also modulate NF-kB biological activity. 
Accordingly, the invention features methods and reagents for the identification 
10 of NF-kB modulating compounds. 

BCMA poljT^eptides or nucleic acid molecules are also useful for the 
treatment of diseases associated with insufficient or inappropriate NF-kB 
biological activity or expression. BCMA polypeptides or nucleic acid 
molecules are administered to a patient using an appropriate delivery vehicle, 
15 as known in the art. Generally, the BCMA polypeptide or nucleic acid 
molecule is delivered in a pharmaceutically acceptable carrier. 

Coupled Target Identification and Assay Generation 

Directly relevant to the NF-kB signal transduction pathway is the 

20 identification of molecules that could potentially play a role in regulating 

inflammation and oncogenesis. The search for NF-kB activating molecules do 
not have to be restricted to screening cDNA expression libraries; the same 
principles can be used to identify compounds which modulate the output of the 
pathway, either by mimicking the activity of a polypeptide or by blocking its 

25 activity when the polypeptide is overexpressed. The latter compounds will act 
either upon or downstream of the overexpressed protein; once enough 
activating proteins have been identified, it will be possible to identify at what 
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step the compounds act by evaluating their action on a panel of transfected cells 
expressing different activating proteins. 

In general, compounds are identified from large libraries of both 
natural product and synthetic (or semi-synthetic) extracts or chemical libraries 

5 according to methods known in the art. Those skilled in the field of drug 
discovery and development will understand that the precise source of test 
extracts or compounds is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any number of chemical extracts or 
compounds can be screened using the methods described herein. Examples of 

10 such extracts or compounds include, but are not limited to, plant-, fungal-, 
prokaryotic- or animal-based extracts, fermentation broths, and synthetic 
compounds, as well as modification of existing compounds. Numerous 
methods are also available for generating random or directed synthesis (e.g., 
semi-synthesis or total synthesis) of any number of chemical compounds, 

15 including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 
based compounds. Synthetic compound libraries are commercially available 
from Brandon Associates (Merrimack, NH) and Aldrich Chemical (Milwaukee, 
WI). Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant, and animal extracts are commercially available from a number of 

20 sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch 
Oceangraphics Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. (Cambridge, 
MA). In addition, natural and synthetically produced libraries are produced, if 
desired, according to methods known in the art, e.g., by standard extraction and 
fractionation methods. Furthermore, if desired, any library or compound is 

25 readily modified using standard chemical, physical, or biochemical methods. 
In addition, those skilled in the art of drug discovery and 
development readily understand that methods for dereplication (e.g., taxonomic 
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dereplication, biological dereplication, and chemical dereplication, or any 
combination thereof) or the elimination of rephcates or repeats of materials 
already known to modulate the test promoter should be employed whenever 
possible. 

5 When a cmde extract is found to modulate reporter gene expression, 

further fractionation of the positive lead extract is necessary to isolate chemical 
constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization 
and identification of a chemical entity within the crude extract having the 
10 desired activity. Methods of fractionation and purification of such 

heterogenous extracts are known in the art. If desired, compounds shown to be 
useful agents for the modulation of gene expression or biological activity are 
chemically modified according to methods known in the art. 

15 Primaiy screens for compounds that modulate BCMA biological activity 
Modulating NF-kB expression or biological activity modulates 
numerous important cellular processes. The finding that NF-kB activity is 
regulated by BCMA allows us to provide assays for dmgs that modulate NF-kB 
by monitoring BCMA expression or biological activity. Such assays may 

20 measure BCMA expression by measuring changes in: (a) levels of BCMA 
protein; (b) levels of BCMA RNA; (c) levels of BCMA-mediated NF-kB 
biological activity; or (d) levels of a reporter gene or protein expressed from a 
NF-kB promoter. These measurements may be made in vitro or in vivo. These 
assays allow for the identification of compounds that modulate NF-kB 

25 biological activity (e.g., gene transcription). Such identified compounds may 
have therapeutic value, for example, in the treatment of diseases that result in 
too little or too much cell death. 
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Cells overexpressing BCMA can be produced using standard 
techniques. Compounds that are identified may bind to BCMA and prevent 
BCMA activation of NF-kB. While screening of compounds can be performed 
with cultures of primary cells, cell lines be also be used. Cell lines can be 

5 modified such that the cells constitutive] y express a BCMA polypeptide, for 
example, the BCMA intracellular domain. 

Any cell line, such as ones described above, can also be engineered 
to contain a reporter gene expressed under control of the NF-kB promoter 
(described above). A preferred reporter gene codes for GFP. Typically, the 

0 expression of the gene (e.g., the endogenous NF-kB gene or a recombinant 
reporter gene expressed under the control of the NF-kB promoter or fragment 
thereof) is measured by assaying the RNA or protein levels or both of the 
expressed gene. For example, the polypeptide expressed by the NF-kB gene or 
by the reporter gene produces a detectable signal under conditions that allow 

5 compound-mediated changes to be measured. Quantitatively determining the 
amount of signal requires comparing the amount of signal produced in the 
absence of any compound being tested to the amount produced when the cell is 
contacted with the compound, as is described herein. The comparison permits 
the identification of the compound as one that causes a change in the detectable 

0 signal produced by the expressed gene (e.g., at the RNA or protein level) and 
thus identifies a compound that is capable of modulating NF-kB expression. In 
order to prevent the NF-kB cells fi-om dying, a second gene encoding an 
apoptosis inhibitor can also be expressed in the cells, as described herein. 

5 Secondaiy screens for compounds that modulate NF-kB activity^ 

After test compounds that appear to modulate NF-kB expression are 
identified, it may be necessary or desirable to subject these compounds to 
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further testing. The invention provides such secondary confirmaTor>' assays. 
For example, a compound that appears to modulate NF-kB activity in early 
testing may be subject to additional assays to confirm that the compound also 
modulates NF-kB activity in vivo. In the first round of in vivo testing, NF-kB 
5 activity is initiated in animals by well-known methods and the compound is 
administered by one of the means described in the "Therapy" section, 
immediately below. Cells or cellular tissue are isolated within hours to days 
following the insult, and are subjected to assays to assess the level of NF-kB 
expression or biological activity. Such assays are well known to those skilled 
10 in the art. Examples of such assays include, but are not limited to, ELISAs, 
Western blot analysis, RT-PCR, RIA, and Northern blot analysis. 

Therapy 

NF-kB is an important regulator of inflammatory responses (e.g., 
15 rheumatoid arthritis, inflammatory bowel disease, septic shock), apoptosis, 
oncogenesis, and anti-viral and anti-bacterial responses. Therefore, the 
discovery of new gene products that regulate NF-kB activity, and thus the 
disease process, will result in the identification of molecular targets for 
pharmacological intervention. By increasing or mimicking BCMA biological 
20 activity, one could, for example, boost anti-tumor antibody production or 
increase T cell cytotoxicity against tumor cells in cancer immunotherapy. 
Conversely, antagonizing BCMA biological activity would be advantageous, 
for example, in situations in which it is desirable to down-regulate immune cell 
function. Compounds, identified using any of the methods disclosed herein, 
25 may be administered to patients or experimental animals with a 

pharmaceutically- acceptable diluent, carrier, or excipient, in unit dosage form. 
Conx'entional pharmaceutical practice may be employed to provide suitable 
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formulations or compositions to administer such compositions to patients or 
experimental animals. Although intravenous administration is preferred, any 
appropriate route of administration may be employed, for example, parenteral, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 

5 intraventricular, intracapsular, intraspinal, intracistemal, intraperitoneal, 

intranasal, aerosol, or oral administration. Therapeutic formulations may be in 
the form of liquid solutions or suspensions; for oral administration, 
formulations may be in the form of tablets or capsules; and for intranasal 
formulations, in the form of powders, nasal drops, or aerosols. 

10 Methods well known in the art for making formulations are found in, 

for example. Remington: The Science and Practice of Pharmacy ^ (19th ed.) ed. 
Gennaro AR., 1995, Mack Publishing Company, Easton, PA. Formulations for 
parenteral administration may, for example, contain excipients, sterile water, or 
saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable 

15 origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide 

polymer, lactide/glycolide copolymer, or polyoxyethyl ene-polyoxypropyl ene 
copolymers may be used to control the release of the compounds. Other 
potentially usefiil parenteral delivery systems for antagonists or agonists of the 
invention include ethylene-vinyl acetate copol\TTier particles, osmotic pumps, 

20 implantable infusion systems, and liposomes. Formulations for inhalation may 
contain excipients, for example, lactose, or may be aqueous solutions 
containing, for example, polyoxyethylene-9-lauryl ether, glycocholate and 
deoxycholate, or may be oily solutions for administration in the form of nasal 
drops, or as a gel. 

25 As described herein, we have discovered that BCMA activates NF- 

kB activity, NF-kB, in turn, activates numerous cellular processes. Hence, any 
compound that modulates NF-kB expression is a candidate compound for use 
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in drug development. One possible compound is a polypeptide fragment of 
BCMA which maintains its ability to bind to another component of the signal 
transduction pathway but has lost its ability to activate NF-kB. Such a 
polypeptide will act as an inhibitor of wild-type NF-kB signaling . Another 
5 possible compound is a polypeptide fragment of BCMA which exhibits 
constitutive activation of NF-kB. 

Other Embodiments 

All publications and patent applications mentioned in this 

10 specification are herein incorporated by reference to the same extent as if each 
independent pubhcation or patent appHcation was specifically and individually 
indicated to be incorporated by reference. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further 

15 modifications and this application is intended to cover any variations, uses, or 
adaptations of the invention following, in general, the principles of the 
invention and including such departures from the present disclosure come 
within known or customary practice within the art to which the invention 
pertains and may be applied to the essential features hereinbefore set forth. 

20 

"WHiat is claimed is: 
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1 . A method for identifying a polypeptide which increases gene 
expression from a promoter, said method comprising the steps: 

(a) contacting a library of polypeptides with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is increased 
if said library comprises a poh^peptide which increases gene expression from 
said promoter; 

(b) determining whether said reporter gene expression is increased in 
said cell as a result of contact with said polypeptide library; and 

(c) if said reporter gene expression is increased, identifying a 
polypeptide of said library which increases said reporter gene expression. 

2. The method of claim 1 , wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a polypeptide which activates reporter gene expression is 
identified. 

3. A method for identifying a polj^Deptide which decreases gene 
expression from a promoter, said method comprising: 

(a) contacting a library of polypeptides with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is decreased 
if said library comprises a polypeptide which decreases gene expression from 
said promoter; 

(b) determining whether said reporter gene expression is decreased in 
said cell as a result of contact with said polypeptide library; and 

(c) if reporter gene expression is decreased, identifying a polypeptide 
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which decreases said reporter gene expression. 

4. The method of claim 3, wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a polypeptide which decreases reporter gene expression is 
identified. 

5. A method for identifying a polypeptide which modulates 
activation of a transcription factor activation domain, said method comprising: 

(a) contacting a library of polypeptides with a cell that expresses a 
recombinant anti-cell death gene and a chimeric transcription factor consisting 
of a yeast or bacterial DNA binding domain fused to a mammalian transcription 
factor activation domain, and that comprises a reponer gene operably linked to 
a promoter consisting of a basal promoter and binding sites for said DNA 
binding domain, whereby expression of said reporter gene is altered if said 
library comprises a polypeptide which modulates activation of said 
transcription factor activation domain; 

(b) determining whether said reporter gene expression is altered in 
said cell as a result of contact with said polypeptide library; and 

(c) if reporter gene expression is altered, identifying a polypeptide 
which modulates said reporter gene expression. 

6. The method of claim 5, wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a polypeptide which modulates reporter gene expression 
is identified. 
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7. The method of claim 1, 3, or 5, wherein said contacting comprises 
expressing a library of DNA molecules in a cell, wherein said library of DNA 
molecules encodes said library of polypeptides. 

5 8. The method of claim 7, wherein said cell is the same cell that 

expresses a recombinant anti-cell death gene and that comprises a reporter gene 
operably linked to said promoter. 

9. The method of claim 7, wherein said library of DNA molecules is 
10 introduced to said cell by transfection. 

10. The method of claim 9, wherein the mean number of said DNA 
molecules introduced by transfection into said cell is at least 25. 

15 11. The method of claim 9, wherein the mean number of said DNA 

molecules introduced by transfection into said cell is at least 100. 



12. The method of claim 9, wherein the mean number of said DNA 
molecules introduced by transfection into said cell is at least 500. 

20 

13. The method of claim 7, wherein said DNA molecules are 
expressed from a high-efficiency expression system. 



25 



14. The method of claim 1, 3, or 5, wherein said polypeptide is 
produced by the same cell that expresses a recombinant anti-cell death gene and 
that comprises a reporter gene operably linked to said promoter. 
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15. The method of claim 1, 3, or 5, wherein said pol3Apeptide is 
produced by a cell other than the cell that expresses a recombinant anti-cell 
death gene and that comprises a reporter gene operably linked to said promoter. 

5 16. The method of claim 1, 3, or 5, wherein said polypeptide is 

selected from the group consisting of an extracellular ligand, a cell surface 
receptor, and a signal transduction intermediate. 



17. A method for identifying a compound which modulates gene 
10 expression from a promoter, said method comprising: 

(a) contacting a library of compounds with a cell that expresses a 
recombinant anti-cell death gene and that comprises a reporter gene operably 
linked to said promoter, whereby expression of said reporter gene is altered if 
said library comprises a compound which modulates gene expression from said 

15 promoter; 

(b) determining whether said reporter gene expression in said cell is 
altered as a result of contact with said compound library; and 

(c) if said reporter gene expression is altered, identifying a 
compound from said library which modulates said reporter gene expression. 

20 

18. The method of claim 17, wherein step (c) comprises (i) dividing 
said library into two or more libraries with less complexity; and (ii) repeating 
steps (a) and (b) until a compound which modulates gene expression from a 
promoter is identified. 



19. The method of claim 1, 3, 5, or 17, wherein said promoter is a 
heterologous promoter. 



wo 00/50633 



PCTAJSOO/04925 



-34- 

20. The method of claim 1, 3, 5, or 17, wherein said promoter is 
derived from a mammal and said library of polypeptides comprises 
polypeptides derived from a bacterium or a vims. 



21, The method of claim 1, 3, 5, or 17, wherein said reporter gene is 

GFP. 

22, The method of claim 1, 3, 5, or 17, wherein said anti-cell death 
gene is selected from the group consisting of bcl family members, lAP family 
members, and crmA. 

23. The method of claim 1, 3, 5, or 17, wherein said cell is selected 
from the group consisting of CHO, CD-I, Cos, 293, HeLa, BHK, or L cells. 

24. A method for determining whether a compound modulates 
NF-kB biological activity, said method comprising the steps of: 

a) providing a cell expressing a BCMA polypeptide; 

b) contacting said cell with a candidate compound; and 

c) measuring the level of expression of said BCMA polypeptide in 
said cell, wherein a change in the level of expression of said BCMA 
polypeptide in said cell, relative to a cell not contacted with said candidate 
compound, identifies said candidate compound as a compound that modulates 
NF-kB biological activity. 



25. A method for determining whether a compound modulates 
BCMA biological activity, said method comprising the steps of: 
a) providing a BCMA polypeptide; 
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b) contacting said poKpeptide with a candidate compound; and 

c) measuring the level of biological activitv* of said BCMA 
polypeptide, wherein a change in the level of biological activity of said BCMA 
polypeptide, relative to a polypeptide not contacted with said candidate 
compound, identifies said candidate compound as a compound that modulates 
BCMA biological activity. 

26. The method of claim 25, wherein said BCMA polypeptide is in a 

cell. 

27. The method of claim 25, wherein said BCMA polypeptide is in a 
cell-free system. 

28. The method of claim 25, wherein said BCMA biological activity 
is the modulation of NF-kB biological activity. 

29. The method of claim 28, wherein said NF-kJB biological activity 
is the modulation of cell death. 

30. The method of claim 25, wherein said BCMA polypeptide 
comprises a polypeptide sequence having substantial identity to amino acids 
98-164 of human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse 
BCMA (SEQ ID NO: 2). 

31. A method for determining whether a compound modulates 
NF-kB activir\\ said method comprising the steps of: 

a) providing a BCMA polypeptide; 
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b) contacting said polvT^eptide with a candidate compound; and 

c) detecting the binding of said candidate compound to said 
polypeptide, wherein a candidate compound that binds to said polypeptide is a 
compound that modulates NF-kB biological activity. 

5 

32. The method of claim 3 1 , wherein said BCMA polypeptide 
comprises a polypeptide sequence having substantial identity to amino acids 
98-164 of human BCMA (SEQ ID NO: 1) or amino acids 97-163 of mouse 
BCMA(SEQ ID NO: 2). 

10 

32. A substantially purified polypeptide comprising a polypeptide 
sequence having substantial identity to amino acids 98-164 of human BCMA 
(SEQ ID NO: 1) and not having amino acids 1-54 of human BCMA (SEQ ID 
NO: 1). 

15 

33. The polypeptide of claim 32, wherein said polypeptide 
modulates NF-kB activity. 

34. The polypeptide of claim 32, wherein said polypeptide consists 
20 of amino acids 98-164 of human BCMA (SEQ ID NO: 1). 

35. A substantially purified polypeptide comprising a polypeptide 
sequence having substantial identity to amino acids 97-163 of mouse BCMA 
(SEQ ID NO: 2) and not having amino acids 1-49 of mouse BCMA (SEQ ID 

25 NO: 2). 

36. The polypeptide of claim 35, wherein said poKpeptide 
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modulates NF-kB activity. 

37. The polypeptide of claim 35, wherein said polypeptide consists 
of amino acids 97-163 of mouse BCMA (SEQ ID NO: 2). 

5 

38. An NF-kB modulator comprising a polypeptide of claim 32 or 
35 covalently linked to a heterologous compound. 

39. The modulator of claim 38, wherein said modulator modulates 
10 NF-kB activity. 

40. A method for activating NF-kB activity in a cell, comprising 
contacting said cell with a recombinant BCMA polypeptide having NF-kB 
activating activity. 

15 

41 . A method for activating NF-kB activity in a cell, comprising 
contacting said cell with a recombinant nucleic acid molecule encoding a 
BCMA polypeptide having NF-kB activating activity. 

20 42. Use of a BCMA polypeptide for preparing a pharmaceutical 

composition for treating cancer, apoptosis, a viral infection, or an inflammatory 
response. 



25 



43. Use of a BCMA nucleic acid molecule for preparing a 
pharmaceutical composition for treating cancer, apoptosis, a viral infection, or 
an inflammatory response. 
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MLQMAGQCSQ NEYFDSLLHA 
GLSLIISLAV FVLMFLLRKI 

YTVEECTCED CIKSKPKVDS 
ISAR (SEQ ID NO: 1) 



Human BCNA 

CIPCQLRCSS NTPPLTCQRY 
SSEPLKDEFK NTGSGLLGMA 
DHCFPLPAME EGATILVTTK 



CNASVTNSVK GTNAILWTCL 
NIDLEKSRTG DEIILPRGLE 
TNDYCKSLPA ALSATEIEKS 



MAQQCFHSEY FDSLLHACKP 
LSLALFTISF LLRKMNPEAL 
TVEECTCEDC VKSKPKGDSD 
PTHTR (SEQ ID NO: 2) 



Fig. 7A 



Mouse BCNA 

CHLRCSNPPA TCQPYCDPSV 
KDEPQSPGQL DGSAQLDKAD 
HFFPLPAMEE GATILVTTKT 



TSSVKGTYTV LWIFLGLTLV 
TELTRIRAGD DRIFPRSLEY 
GDYGKSSVPT ALQSVMGMEK 



Fig. 7B 
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SEQUENCE LISTING 

<110> The General Hospital Corporation 

<12 0> METHCr: FOR CLONING SIGNAL TRANSDUCTION 
INTERMEDIATES 

'<130> 00786 '371W02 

<150> 60/121,485 
<151> 1999-02-24 

<160> 2 

<170> FastSEQ for Windov/s Version 4.0 

<210> 1 
<211> 184 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Leu Gin Met Ala Gly Gin Cys Ser Gin Asn Glu Tyr Phe Asp Ser 

15 10 15 

Leu Leu His Ala Cys lie Pro Cys Gin Leu Arg Cys Ser Ser Asn Thr 

20 25 30 

Pro Pro Leu Thr Cys Gin Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser 

35 40 45 

Val Lys Gly Thr Asn Ala lie Leu Trp Thr Cys Leu Gly Leu Ser Leu 

50 55 60 

lie lie Ser Leu Ala Val Phe Val Leu Met Phe Leu Leu Arg Lys He 
65 70 75 30 

Ser Ser Glu Pro Leu Lys Asp Glu Phe Lys Asn Thr Gly Ser Gly Leu 

85 90 95 

Leu Gly Met Ala Asn He Asp Leu Glu Lys Ser Arg Thr Gly Asp Glu 

100 105 110 

He He Leu Fro Arg Gly Leu Glu Tyr Thr Val Glu Glu Cys Thr Cys 

115 120 125 

Glu Asp Cys He Lys Ser Lys Pro Lys Val Asp Ser Asp His Cys Phe 

130 135 140 

Pro Leu Pro Ala Met Glu Glu Gly Ala Thr He Leu Val Thr Thr Lys 
145 150 155 160 

Thr Asn Asp T:--r Cys Lys Ser Leu Pro Ala Ala Leu Ser Ala Thr Glu 

165 170 175 

He Glu Lys Ser He Ser Ala Arg 
150 

<210> 2 
<211> 185 
<212> PRT 

<213> Mus musculus 



<400> 2 

Met Ala Gin Gin Cys 



Phe His Ser Glu Tyr Phe Asp Ser Leu Leu His 
SUBSTITUTE SHEET (RULE 26) 
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15 10 .5 

Ala Cys Lys Pro Cys His Leu Arg Cys Ser Asn Pro Pro Ala Thr Cys 

20 25 30 

Gin Pro Tyr Cys Asp Pro Ser Val Thr Ser Ser Val Lys Gly Thr Tyr 

35 40 45 

Thr Val Leu Trp lie Phe Leu Gly Leu Thr Leu Val Leu Ser Leu Ala 

50 55 60 

Leu Phe Thr lie Ser Phe Leu Leu Arg Lys Met Asn Pro Glu Ala Leu 
65 70 75 80 

Lys Asp Glu Pro Gin Ser Pro Gly Gin Leu Asp Gly Ser Ala Gin Leu 

85 90 95 

Asp Lys Ala Asp Thr Glu Leu Thr Arg lie Arg Ala Gly Asp Asp Arg 

100 105 110 

lie Phe Pro Arg Ser Leu Glu Tyr Thr Val Glu Glu Cys Thr Cys Glu 

115 120 125 

Asp Cys Val Lys Ser Lys Pro Lys Gly Asp Ser Asp His Phe Phe Pro 

130 135 140 

Leu Pro Ala Met Glu Glu Gly Ala Thr lie Leu Val Thr Thr Lys Thr 
145 ISO 155 160 

Gly Asp Tyr Gly Lys Ser Ser Val Pro Thr Ala Leu Gin Ser Val Met 

165 170 175 

Gly Met Glu Lys Pro Thr His Thr Arg 
180 185 
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